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Abstract 

Ce cours presente une introduction a quelques problemes d'interet actuel en me- 
canique statistique. Chaque chapitre est focalise sur un sujet. On expliqera ces sujet 
en s'appuyant sur les resultats d'experiences. On presentera leur modelisation ainsi 
qu'une selection de methodes theoriques utilisees pour leur etude. 

This document includes: (1) A detailed schedule of the lectures, TDs and exam. 
(2) A draft of the Lecture notes on Statistical Mechanics. It presents a summary of 
the material that will be described during the semester. They are certainly incom- 
plete and may contain errors. Hopefully we shall improve it with the help of the 
students. (3) The TDs. 
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1 Programme 

Les TDs sont congus pour etre faits a la maison apres une aide donnee en cours. Leurs 
solutions devront etre rendus et cela fera parti de la note finale. 

1. Introduction Rappel des notions de probabilites et statistique [1, 2, 3, 4]. 

Quelques applications : l'experience de Luria-Delbriick [5]; statistiques d'extremes [6, 
7]; un paradigme, le modele d'Ising et ses applications interdisciplinaires [8, 9, 10, 
11,12]. 

TD 1 : exercises simples de rappel. 

Lecture : Particle de Luria-Delbriick [5]; structures dans l'univers [14]. 

2. Transitions de phase [8, 9, 10, 11, 12] 

Exemples : vapeur-liquide, paramagnetique-ferromagnetique, DNA. Classification 
des transitions. 

Approche de champ moyen. 

Phenomenes critiques, invariance d'echelle, introduction au groupe de renormalisa- 
tion [10, 11, 13]. 

TD 2 : modele d'Ising ferromagnetique completement connecte avec des interactions 
a p spins, (p = 2) transition de second ordre, (p > 2) transition de premier ordre. 

Lecture : Transitions de phase, un exemple mecanique [15]. 

3. Desordre [16, 17, 18, 19, 20] 

Definition de desordre recuit et desordre gele (exemples). Leur traitement statis- 
tique. 

Desordre gele : effets de la frustration, auto-moyennage de l'energie libre, introduc- 
tion a la methode des repliques. 

Exemples : verres de spin, potentiels aleatoires, reseau de neurones. 
TD 3: La chame de spins avec des interactions desordonnees. 
Lecture : Le probleme de la rupture [21] 

4. Processus stochastiques [22, 23] 

Mouvement Brownien. L'equation maitresse, l'approche de Langevin et Fokker- 
Planck. 

TD 4 : Ratchets, application a l'electrophorese. 

5. Croissance de surfaces et d'interfaces [24] 
Exemples. Rugosite. Lois d'echelle dynamiques. 

Le 'random deposition model' : solution, limite continue et presentation des equa- 
tions de Edwards- Wilkinson et de Kardar-Parisi-Zhang. 

TD 5 : Etude de l'equation d'Edwards- Wilkinson. 
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2 Introduction 
2.1 Background 

Equilibrium Statistical Mechanics is a very well-established branch of theoretical physics 
Together with Quantum Mechanics, they form the basis of Modern Physics. 

The goal of equilibrium statistical mechanics is to derive the thermodynamic functions 
of state of a macroscopic system from the microscopic laws that determine the behaviour 
of its constituents. In particular, it explains the origin of thermodynamic - and intuitive 
- concepts like presure, temperature, heat, etc. 

In Table 1 we recall the typical length, time and energy scales appearing in the micro- 
scopic and macroscopic World. 
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Table 1: Typical length, energy and time scales in the microscopic and macroscopic World. 

A reference number is the number of Avogadro, Na = 6.02 10 23 ; it counts the number 
of atoms in a mol, i.e. 12gr of 12 C, and it yields the order of magnitude of the number of 
molecules at a macroscopic level. The ionization energy of the Hydrogen atom is 13.6 eV 
and sets the energy scale in Table 1. 

It is clear from the figures in Table 1 that, from a practical point of view, it would be 
impossible to solve the equations of motion for each one of the A" ss Na particles - let us 
keep the discussion classical, including quantum mechanical effects would not change the 
main conclusions to be drawn henceforth - and derive from their solution the macroscopic 
behaviour of the system. Moreover, the deterministic equations of motion may present 
a very high sensitivity to the choice of the initial conditions - deterministic chaos - and 
thus the precise prediction of the evolution of the ensemble of microscopic constituents 
becomes unfeasible even from a more fundamental point of view. 

The passage from the microscopic to the macroscopic is then done with the help of 
Statistical methods, Probability Theory and, in particular, the Law of Large Numbers. It 
assumes - and it has been quite well confirmed - that there are no big changes in the 
fundamental Laws of Nature when going through all these orders of magntide. However, a 
number of new and interesting phenomena arise due to the unexpected collective behaviour 
of these many degrees of freedom. For example, phase transitions when varying an external 
parameter occur; these are not due to a change in the form of the microscopic interactions 
but, rather, to the locking of the full system in special configurations. 

Equilibrium statistical mechanics also makes another very important assumption that 
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we shall explain in more detail below: that of the equilibration of the macroscopic system. 
Some very interesting systems do not match this hypothesis. Still, one would like to 
use Probabilistic arguments to characterize their macroscopic behavior. This is possible 
in a number of cases and we shall discuss some of them. Indeed, deriving a theoretical 
framework to describe the behavior of macroscopic systems out of equilibrium is one the 
present major challenges in theoretical physics. 

2.2 This course 

In this set of lecture we shall discuss same problems in equilibrium and dynamic statistical 
mechanics that either are not fully understood or receive the attention of researchers at 
present due to their application to problems of interest in physics and other areas of 
science. The plan of the set of lectures is the following: 

In the first Chapter we recall some aspects of Probability Theory and Statistical Me- 
chanics. In Appendix A we recall the definition of a probability, its main properties, and 
the main probability functions encountered in hard Science. In the main part of the text 
we show the utility of Probability arguments by discussing the experiment of Luria and 
Delbriick - considered the founder of Molecular Biology, the authors received the Nobel 
Prize in 1969 - and some aspects of the laws of large numbers and extreme value statistics. 
Basic features of the foundations of Statistical Mechanics are recalled next. Finally, we 
define the Ising model and list some of its more important properties that render it such 
a standard model. 

In the second Chapter we describe the theory of phase transitions: first we explain the 
mean-field approach, and then we introduce fluctuations and show how the importance 
of these led to the development of the renormalization group. 

The third Chapter is devoted to the discussion of disorder and its effects in phase 
transitions and low-temperature behavior. We briefly describe the three main routes to 
describe these systems analytically: scaling, the replica method and functional renormal- 
ization group ideas. 

In the last two Chapters we introduce time into the discussion. First, in Chapter 4 
we define stochastic processes, the Langevin and Fokker-Planck formalism and we briefly 
discuss the dynamics of macroscopic systems close to thermal equilibrium. Next, in 
Chapter 5 we treat the problem of the random growth of a surface. We introduce some 
physical examples and two simple models, we discuss the behavior in terms of scaling laws 
and we also solve one of these models analytically while describing what fails in the other 
and the need to use improved renormalization group ideas out of equilibrium. 

It is clear that the correct explanation of all of these problems and alytical method 
would need many more hours of teaching. We shall only give the main ingredients of each 
of them and provide the interested students with references to deepen their knowledge of 
these subjects. The cours is intimately related to the one of J-M di Meglio (first semester) 
and O. Martin (second semester). 
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3 Basic notions 

3.1 Applications of probability theory 
3.1.1 The Luria-Delbriick experiment 

This is a very cute example of the use of probability concepts - and more generality, of a 
training in Physics! - in varied problems in Science. 

This experiment provided the first proof of genetic mutation in bacteria. It works as 
follows. Colonies of bacteria, typically including 10 9 members, are exposed to a virus that 
kills most of them. However, after some time, some new bacteria appear showing that 
either 

(i) some bacteria acquire resistance to the virus (adaptation, Lamarckian hypothesis), or 

(ii) their ancestors already possessed resistance via mutation, 
when they were exposed to the virus. 



Figure 1: A tree representing the growth of a colony of bacteria under the mutation hy- 
pothesis. Each level in the tree corresponds to a generation. Time grows going down along 
the vertical direction. The nodes represent individuals in the colonies. The individuals 
shown with a cross are the mutant ones. The individuals painted white are sensitive to 
the virus while the ones painted black are resistant. 

How can one distinguish between the two scenarii? It is clear that if the adaptive 
hypothesis held true, the spatial distribution of the resistant bacteria in the sample would 
be uniform and, moreover, the number of resistant bacteria would not increase with the 
age of the sample. If, in contrast, the mutation hypothesis held true, then, the number of 
resistant bacteria should be larger for an older sample and, they should appear in groups 
of individuals related by inheritage. Luria developed an experimental analysis, based on 
the smart use of Statistical analysis (see Appendix A) that allowed him to determine that 
the mutation hypothesis is correct. 

Luria's idea was to study finite-size sample-to-sample fluctuations to distinguish be- 
tween adaptation and mutation. Take a colony of bacteria of a given age, say it contains 
N individuals, and divide it in M samples with N/M individuals each. Expose now 
each sample to virus and count how many resistant bacteria, n k with k — 1, . . . , M, are 
in each of them. If the adaptation hypothesis is correct, all sample react in roughly the 
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same way, and n should be described by a Poisson distribution characterized by an av- 
erage that coincides with the mean-square deviation, ( n ) = ( (n — ( n )) 2 ) = /i, with /i 
the parameter of the Poisson distribution. If, instead, the mutation hypothesis is correct 
some samples should have many more resistant bacteria than others. Thus, the mean- 
square-displacement should be much larger than the average. 

Let us discuss the theory behind Luria's argument in a bit more detail. Remember 
the usual calculation of the density fluctuations in a gas. Let N and V be the number of 
particles and volume of a gas and n be the number of particles in a very small volume v 
within the bulk, v <C V. Since the gas is uniform, the probability that any given particle 
is in v is just v/V and the probability that n given particles are in v is just (v/V) n . 
Similarly, the probability that a particle is not in v is 1 — v/V and the probability that 
N — n particles are not in v is (1 — v/V) N ~ n . The probability of having any n particles 
in v is given by the binomial formula: 

TV 1 / v \ n ( v \ N - n 
P(n,v; N,V) = —— — - ( — ) ( 1 — — : ) . (3.1) 



n\(N-n)\ XV J V V 

The prefactor gives the number of ways of choosing n particles from the total number N. 
Now, this expression can be simplified, and becomes Poisson's law, in the limits v <C V, 

( n \n p -{n) 

p{n) = ^ir- ■ <3 - 2) 

(In the third factor one replaces the exponent N — n ~ iV; the first factor is approximated 
by using Stirling formula for the factorials, see Appendix [?]; and then (n) = Nv/V as 
one can also verify by computing the average directly from this expression.) Note that 
n can be very different from its mean value, (n) but it is still much smaller than iV that 
has been taken to infinity in this calculation. From this expression one can compute the 
average (n 2 ) and the mean-square fluctuations of the number of particles 

a 2 = (n 2 ) - (n) 2 = (n) . (3.3) 

This is the main characteristics of the Poisson law exploited by Luria. 

Luria carried out these experiments and he found the desired result, the sample-to- 
sample fluctuations were much larger than predicted from a Poisson distribution and 
hence he concluded that the mutation hypothesis was correct. In collaboration with 
Delbriick they refined the analysis and even found a way to estimate the mutation rate 
from their experimental observations. They got the Nobel Prize in Medicine in 1969. 



3.1.2 Statistics of extremes 

The statistics of extreme values is currently appearing in a number of interesting problems 
in hard and applied Sciences. In all sorts of applications, accurate risk assessment relies 
on the effective evaluation of the extremal behavior of the process under study. Unlike 
most of Statistics which tries to say something about typical behavior, Extreme Value 
Statistics attempts to characterize unlikely behavior, or at least to say how unlikely the 
behavior is. Applications include: flood risk assessment; financial risk management; in- 
surance assessment; setting industrial safety standards; the prediction of extreme weather 
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conditions. In the context of our lectures, we shall see Statistics of extremes appearing in 
Phase transitions and diffusion processes. 

The mathematical question is the following. Let us study a sequence s N = xi, . . . , xjy 
of realizations of a random variable x with a probability distribution function p(x). Each 
entry Xi is an independent identically distributed random number. A natural question to 
ask is what is the maximum value taken by x, i.e. what is x max = max(xi,x 2 , . • . , xjv)? 
it is clear that x max is itself a random variable (since if we drew different sequences s N we 
would obtain different values of x max ). We then need to characterize x max in probability 
and the quantity we need to determine is its probability distribution function, q(x max ). 

This kind of question was raised in the context of studies funded by insurance compa- 
nies in the Netherlands... it was useful to know how high could it be a raise in the sea 
level... Emil Gumbel developed part of the theory of extreme value statistics. Quoting 
him "It seems that the rivers know the theory. It only remains to convince the engineers of 
the validity of this analysis. " or also "II est impossible que I'improbable n 'arrive jamais. " 
With the computational advances and software developed in recent years, the application 
of the statistical theory of extreme values to weather and climate has become relatively 
straightforward. Annual and diurnal cycles, trends (e.g., reflecting climate change), and 
physically-based covariates (e.g., El Nino events) all can be incorporated in a straightfor- 
ward manner. 

Let us call /(A) the cumulative probability, 

/(A)= f A dxp(x), (3.4) 

J — oo 

i.e. the probability of x being smaller than A and g(A) the cumulative probability of 

%max- 

,A 

g(A) = / dx max q(x max ) . (3.5) 

J — oo 

Now, in order to have x max < A one needs to have all x's smaller than A. Thus, 

2(A) = [/(A)f (3-6) 
Let us call h(A) the probability that x is larger than A, 

h(A) = 1 - /(A) (3.7) 
If A is a large number, h(A) is expected to be small, and 

g(A) = [1 - h(A)f ~ l - Nh(A) ~ e - Nh{K) (3.8) 

One needs to evaluate this expression for different functions h(A). It turns out that for 
all p(x) that are not bounded and that fall off to zero faster than exponentially one has 
the Gumbel distribution of the maximum 

q(x max ) = be- b ^- s ^- Hxma °- s)) . (3.9) 

The two parameters b and s fix the mean and the variance of q and depend on the ones 
of x. Note that a better representation is obtained using the reduced variable b(q max — s) 
and a logarithmic scale. 
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Actually one can also derive the pdf of the a-th value in the sequence a = 1 being the 
maximum, a = 2 the next one, and so on and so forth, to find 



with J\f the normalization constant that depends on a, b and s. 

Other p{x) with different decaying forms at infinity (slower than exponential) fall into 
different classes of extreme value statistics (Frechet, Weibull). 

We shall see extreme value statistics play a role in the context of critical phenomena 
(Chapter 2) and surface growth (Chapter 5). 

3.2 Elements in statistical mechanics 

Let us here recall some important features of Statistical Mechanics [1, 2, 3, 4]. 

The state of a classical system made of % = 1, . . . , N particles living in rf-dimensional 
space is fully characterized by a point in the 2dN dimensional phase space T made of the 
coordinates, q, and momenta, p, of the particles (q, p) = (q{, qf, q\ 1 q\ ) q%, ,. . . , q l N , q%, q%, 
Pi,Pi,Pi,P2:P2iPh- ■ ■ iPniPniPn)- The Hamiltonian of the system is H(q,p) and the 
time evolution is determined by Hamilton's equation of motion (equivalent to Newtonian 
dynamics, of course). As time passes the representative point (q(t),p(t)) traces a path 
in T. Energy, E, is conserved if the Hamiltonian does not depend on time explicitly and 
thus all points in any trajectory lie on a constant energy surface, H = E. 

Liouville's theorem states that a volume element in phase space does not change in the 
course of time if each point in it follows the microscopic Hamilton laws of motion. It is 
pretty easy to show just by computing the Jacobian of a change of variables corresponding 
to the infinitesimal evolution of a little volume dT dictated by the equations of motion. 

If the macroscopic state of the system is characterized by the number of particles iV, 
the volume V and the energy E that, say, lies between E and E + dE, all microstates, 
i.e. all configurations on the constant energy surface, are equivalent. We can think about 
all these microstates as being (many) independent copies of the original system. This is 
Gibbs' point of view, he introduced the notion of ensemble as the collection of mental 
copies of a system in identical macroscopic conditions. Different ensembles, correspond 
to different choices of the parameters characterizing the macroscopic state, (N, V, E) in 
the microcanonical, (N, V, T) in the canonical and (p, V, T) in the macrocanonical, with 
T temperature and p the chemical potential. 

But, how can one describe the evolution of the system in phase space? In practice, 
given a macroscopic system with N >> 1, one cannot determine the position and momenta 
of all particles with great precision - uncertainty in the initial conditions, deterministic 
chaos, etc. A probabilistic element enters into play since what one can do is estimate the 
probability that the representative point of the system is in a given region of T. Indeed, 
one introduces a time-dependent probability density p(q,p;t) such that p(q,p;t)dT is the 
probability that the representative point is in a region of volume dT around the point 
(q,p) at time t. Probability behaves like an incompressible fluid in phase space. The 
determinist equations of motion for (q,p) allow us to derive the Liouville deterministic 
equation for the evolution of p: 



q(x a ) = jV e -<*(*«-«) 



_ e -6(x a -s)) 



(3.10) 




(3.11) 
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with the summation convention over repeated indices (i labels particles and a labels 
coordinates). 

Note that if initially one knows the state of the system with great precision, the initial p 
will be concentrated in some region of phase space. At later times, p can still be localized 

- perhaps in a different region of phase - or it can spread. This depends on the system. 
Following Gibbs, the probability density p is interpreted as the one obtained within the 
(microcanonical) ensemble. 

It is important to note that Liouville's equation remains invariant under time-reversal, 
t — > —t and p — > —p. Thus, for generic initial conditions its solutions oscillate in time 
and do not approach a single asymptotic stationary solution that could be identified with 
equilibrium. The problem of how to obtain irreversible decay from Liouville's equation 
is a fundamental one in Statistical Mechanics. We shall come back to this problem in 
Chapter 4. Now, let us mention an attempt to understand the origin of irreversibility in 
terms of flows in phase space, namely, ergodic theory, founded by Boltzmann by the end 
of the XlXth century [3]. 

In the absence of a good way to determine the evolution of p and its approach to a 
stationary state, we can simply look for stationary solutions of eq. (3.11), i.e. p such that 
the right-hand-side vanishes. The simplest such solution is given by a p that depends 
on the energy E only, p(E). Even if it is very difficult to show, this solution is very 
smooth as a function of (q, p) and it is then the best candidate to describe the equilibrium 
state - understood as the one that corresponds to the intuitive knowledge of equilibrium 
in thermodynamics. In short, one postulates that all points in the energy surface E are 
equally likely - there is a priori no reason why some should be more probable than others! 

- and one proposes the microcanonical measure: 

P(E) = b ^^E,E + dE), 
[0 otherwise , 

and then constructs all the Statistical Mechanics machinery on it, constructing the other 
ensembles, showing that thermodynamics is recovered and so on and so forth. 

Finally, let us discuss Boltzmann's and Gibb's interpretation of averages and the ergodic 
hypothesis. Boltzmann interpreted macroscopic observations as time averages of the form 

However, in practice, these averages are impossible to calculate. With the introduction 
of the concept of ensembles Gibbs gave a different interpretation (and an actual way of 
computing) macroscopic observations. For Gibbs, these are averages are statistical ones 
over all elements of the statistical ensemble, 

. N d 

( A ) = c / II II P(q,P)MQ,P) , (3-14) 

J i=i a=l 

with p the measure. In the microcanonical ensemble this is an average over microstates 
on the constant energy surface taken with the microcanonical distribution (3.12): 



. N d 

A ) = c / II II dltdpt S(H(q,p) - E)A(q,p) , (3.15) 

J i=l 0=1 
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and the normalization constant c 1 = JYiiLiI\t=i^(H(q,p) — E). In the canonical en- 
semble the is computed with the Gibbs-Boltzmann weight: 

. N d 

(A) = Z 1 HH dqfdp^ e~ m ^A{q, p) . (3.16) 

J i=l o=l 

Z is the partition function Z = J l\f=i llUi dtfdp? e~ pH ^. 

The (weak) ergodic hypothesis states that under the dynamic evolution the representa- 
tive point in phase space of a classical system governed by Newton laws can get as close 
as desired to any point on the constant energy surface. 

The ergodic hypothesis states that time and ensemble averages, (3.13) and (3.14) co- 
incide in equilibrium for all reasonable observables. This hypothesis cannot be proven in 
general but it has been verified in a large number of cases. In general, the great success 
of Statistical Mechanics in predicting quantitative results has given enough evidence to 
access this hypothesis. 

An important activity in modern Statistical Mechanics is devoted to the study of 
macroscopic systems that do not satisfy the ergodic hypothesis. A well-understood case 
is the one of phase transitions and we shall discuss it in the next section. Other cases 
are related to the breakdown of the equilibration. This can occur either because they are 
externally driven or because they start from an initial condition that is far from equilib- 
rium and their interactions are such that they do not manage to equilibrate. One may 
wonder whether certain concepts of thermodynamics and equilibrium statistical mechan- 
ics can still be applied to the latter problems. At least for cases in which the macroscopic 
dynamics is slow one can hope to derive an extension of equilibrium statistical mechanics 
concepts to describe their behavior. 

Finally, let us remark that it is usually much easier to work in the canonical ensemble 
both experimentally and analytically. Thus, in all our future applications we assume that 
the system is in contact with a heat reservoir with which it can exchange energy and that 
keeps temperature fixed. 

3.3 The Ising model 

The Ising model is a mathematical representation of a magnetic system. It describes 
magnetic moments as classical spins, Sj, taking value ±1, lying on the vertices of a cubic 
lattice in d dimensional space, and interacting via nearest-neighbor couplings, J > 0. The 
energy is then 

E = -jJ2s l s J -hJ2-Si (3.17) 

(ij) i 

where h is an external magnetic field. 

The Ising model is specially attractive for a number of reasons: 

(i) It is probably the simple example of modeling to which a student is confronted. 

(ii) It can be solved in some cases: d — 1 (Chapter 2), d — 2, d — > oo (Chapter 2). The 
solutions have been the source of new and powerful techniques later applied to a variety 
of different problems in physics and interdisciplinary fields. 

(iii) It has not been solved analytically in the most natural case, d = 3! 

(iv) It has a phase transition, a interesting collective phenomenon, separating two phases 
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that are well-understood and behave, at least qualitatively, as real magnets with a para- 
magnetic and a ferromagnetic phase (Chapter 2). 

(v) There is an upper, d ui and lower, d h critical dimension. Above d u mean-field theory 
correctly describes the critical phenomenon. Below di there is no finite T phase transition. 
Below d u mean-field theory fails (Chapter 2). 

(vi) One can see at work generic tools to describe the critical phenomenon like scaling 
(Chapter 2, 5) and the renormalization group (Chapter 2). 

(vii) Generalizations in which the interactions and/or the fields are random variables taken 
from a probability distribution are typical examples of problems with quenched disorder 
(Chapter 3). 

(viii) Generalizations in which spins are not just Ising variables but vectors with n com- 
ponents are also interesting: n — 1 (Ising), n = 2 (XY), n = 3 (Heisenberg), ... , n — > oo 
(0(n)). 

(ix) One can add a dynamic rule to update the spins and we are confronted to the new 
World of stochastic processes (Chapter 4). 

(x) Last but not least, it has been a paradigmatic model extended to describe many 
problems going beyond physics like neural networks, social ensembles, etc. 

In the rest of this set of Lectures we shall discuss the physics of this model and we 
shall study its statics and dynamics with a number of analytic techniques. 
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4 Phase transitions 

Take a piece of material in contact with an external reservoir. The material will be char- 
acterized by certain observables, energy, magnetization, etc.. To characterize macroscopic 
systems it is convenient to consider densities of energy, magnetization, etc, by diving the 
macroscopic value by the number of particles (or the volume) of the system. The external 
environment will be characterized by some parameters, like the temperature, magnetic 
field, pressure, etc. In principle, one is able to tune the latter and the former will be a 
function of them. 

Sharp changes in the behavior of macroscopic systems at critical point (lines) in pa- 
rameter space have been observed experimentally. These correspond to phase transitions, 
a non-trivial collective phenomenon appearing in the thermodynamic limit. In this Sec- 
tion we shall review the main features of, and analytic approaches used to study, phase 
transitions. 

4.1 Order and disorder 

When one cools down a magnetic sample it undergoes a sharp change in structure, as 
shown by a sharp change in its macroscopic properties, at a well-defined value of the 
temperature which is called the critical temperature or the Curie temperature. Assuming 
that this annealing process is done in equilibrium, that is to say, that at each tempera- 
ture step the system manages to equilibrate with its environment after a relatively short 
transient - an assumption that is far from being true in glassy systems but that can be 
safely assumed in this context - the two states above and below T c are equilibrium states 
that can be studied with the standard Statistical Mechanics tools. 

More precisely, at T c the equilibrium magnetization density changes from above T c to 
a finite value below T c , see Fig. 2. The high temperature state is a disordered paramagnet 
while the low temperature state is an ordered ferromagnet. 

One identifies the magnetization density as the order parameter of the phase transition. 
It is a macroscopic observable that vanishes above the transition and takes a continuously 
varying value below T c . The transition is said to be continuous since the order parameter 
grows continuously from zero at T c . 

If one looks in more detail into the behavior of the variation of the magnetization 
density close T c one would realize that the magnetic susceptibility, 



i. e. the linear variation of the magnetization density with respect to its conjugate magnetic 
field h diverges when approaching the transition from both sides. As the second identity 
shows, the susceptibility is just a second derivative of the free-energy density. Thus, 
a divergence of the susceptibility indicates a non-analyticity of the free-energy density. 
This can occur only in the infinite volume or thermodynamic limit, N — > oo. Otherwise 
the free-energy density is just the logarithm of the partition function, a finite number of 
terms that are exponentials of analytic functions of the parameters, and thus an analytic 
function of the external parameters itself. 




(4.1) 
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Figure 2: The magnetization as a function of temperature for three magnetic compounds. 

What is observed near such a critical temperature are called critical phenomena. Since 
the pioneering work of Curie, Langevin and others, the two phases, paramagnetic and 
ferromagnetic are well-understood. Qualitative arguments (see Sect. 4.3) as well as the 
mean-field approach (see Sect. 4.5) captures the two phases and their main characteris- 
tics. However, what happens close to the critical point has remained difficult to describe 
quantitatively until the development of scaling and the renormalization group (Sect. 4.7). 

4.2 Discussion mmsux 

Let su discuss some important concepts, pinning fields, broken ergodicity and broken 
symmetry, with the help of a concrete example, the Ising model (3.17). The dicuss ion is 
however much more general and introduces the concepts mentioned above. 

4.2.1 Pinning field 

In the absence of a magnetic field for pair interactions the energy is an even function of the 
spins, E(s) = E(—s) and, consequently, the equilibrium magnetization density computed 
as an average over all spin configurations with their canonical weight, e _/3H , vanishes at 
all temperatures. 

At high temperatures, m = characterizes completely the equilibrium properties of the 
system since there is a unique paramagnetic state with vanishing magnetization density. 
At low temperatures instead if we perform an experiment we do observe a net magnetiza- 
tion density. In practice, what happens is that when the experimenter takes the system 
through the transition one cannot avoid the application of tiny external fields - the ex- 
perimental set-up, the Earth... - and there is always a small pinning field that actually 
selects one of the two possible equilibrium states, with positive of negative magnetization 
density, allowed by symmetry. In the course of time, the experimentalist should see the 
full magnetization density reverse, however, this is not see in practice since astronomi- 
cal time-scales would be needed. We shall see this phenomenon at work when solving 
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mean-field models exactly below. 

4.2.2 Broken ergodicity 

Introducing dynamics into the problem 1 1 ergodicity breaking can be stated as the fact 
that the temporal average over a long (but finite) time window is different from the statical 
one, with the sum running over all configurations with their associated Gibbs-Boltzmann 
weight: 

A^(A). (4.2) 

In practice the temporal average is done in a long but finite interval r < oo. During this 
time, the system is positively or negatively magnetized depending on whether it is in "one 
or the other degenerate equilibrium states". Thus, the temporal average of the orientation 
of the spins, for instance, yields a non-vanishing result A = m ^ 0. If, instead, one 
computes the statistical average summing over all configurations of the spins, the result is 
zero, as one can see using just symmetry arguments. The reason for the discrepancy is that 
with the time average we are actually summing over half of the available configurations 
of the system. If time r is not as large as a function of iV, the trajectory does not 
have enough time to visit all configurations in phase space. One can reconcile the two 
results by, in the statistical average, summing only over the configurations with positive 
(or negative) magnetization density. We shall see this at work in a concrete calculation 
below. 

4.2.3 Spontaneous broken symmetry 

In the absence of an external field the Hamiltonian is symmetric with respect to — > — 
The phase transition corresponds to a spontaneous symmetry breaking between the states 
of positive and negative magnetization. One can determine the one that is chosen when 
going through T c either by applying a small pinning field that is taken to zero only after 
the thermodynamic limit, or by imposing adequate boundary conditions like, for instance, 
all spins pointing up on the borders of the sample. Once a system sets into one of the 
equilibrium states this is completely stable in the N — > oo limit. 

Ergodicity breaking necessarily accompanies spontaneous symmetry breaking but the 
reverse is not true; see [9] for an example and the discussion in Sect. 5 on disordered 
systems. Indeed, spontaneous symmetry breaking generates disjoint ergodic regions in 
phase space, related by the broken symmetry, but one cannot prove that these are the 
only ergodic components in total generality. Mean-field spin-glass models provide a coun- 
terexample of this implication. 

4.3 Energy vs entropy 

Let us first use a thermodynamic argument to describe the high and low temperature 
phases. 

The free energy of a system is given by F = U — k B TS where U is the internal energy, 
U = (H), and S is the entropy. In the following we measure temperature in units of 

1 Note that Ising model does not have a natural dynamics associated to it. We shall see in Section 6 
how a dynamic rule is attributed to the evolution of the spins. 
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k B , k B T — > T. The equilibrium state may depend on temperature and it is such that it 
minimizes its free-energy F. A competition between the energetic contribution and the 
entropic one may then lead to a change in phase at a definite temperature, i.e. a different 
group of microconfigurations, constituting a state, with different macroscopic properties 
dominate the thermodynamics at one side and another of the transition. 

At zero temperature the free-energy is identical to the internal energy U. In a system 
with ferromagnetic couplings between magnetic moments, the magnetic interaction is such 
that the energy is minimized when neighboring moments are parallel. Thus the preferred 
configuration is such that all moments are parallel and the system is fully ordered. 

Switching on temperature thermal agitation provokes the reorientation of the moments 
and, consequently, misalignments. Let us then investigate the opposite, infinite temper- 
ature case, in which the entropic term dominates and the chosen configurations are such 
that entropy is maximized. This is achieved by the magnetic moments pointing in ran- 
dom independent directions. For example, for a model with N Ising spins, the entropy at 
infinite temperature is S ~ Aln2. 

Decreasing temperature disorder becomes less favorable. The existence or not of a 
finite temperature phase transitions depends on whether long-range order, as the one 
observed in the low-temperature phase, can remain stable with respect to fluctuations, or 
the reversal of some moments, induced by temperature. Up to this point, the discussion 
has been general and independent of the dimension d. 

The competition argument made more precise allows one to conclude that there is 
no finite temperature phase transition in d — 1 while it suggests there is one in d > 1. 
Take a one dimensional ferromagnetic Ising model with closed boundary conditions (the 
case of open boundary conditions can be treated in a similar way), H = —JJ2iLi SiS i+ll 
sn+i = si- At zero temperature it is ordered and its internal energy is just 

U = -JN (4.3) 

with N the number of links and spins. Since there are two degenerate ordered configura- 
tions the entropy is 

S = ln2 (4.4) 

The internal energy is extensive while the entropy is just a finite number. At temperature 
T the free-energy of the ordered state is then 

F = U -TS = -JN -T\n2 . (4.5) 

Adding a domain of the opposite order in the system, i.e. reversing n spins, two bonds 
are unsatisfied and the internal energy becomes 

U 2 = -J(N -2) + 2J = -J(N -4) , (4.6) 

for all n. Since one can place the misaligned spins anywhere in the lattice, there are N 
equivalent configurations with this internal energy. The entropy of this state is then 

S 2 = m(2A) . (4.7) 

The factor of 2 inside the logarithm arises due to the fact that we consider a reversed 
domain in each one of the two ordered states. At temperature T the free-energy of a state 
with one reversed spin and two domain walls is 

F 2 = U 2 - TS 2 = -J(N — 4) — T ln(2A) . (4.8) 
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The variation in free-energy between the ordered state and the one with one domain is 

AF = F 2 - F Q = 4 J - T In N . (4.9) 

Thus, even if the internal energy increases due to the presence of the domain wall, the 
increase in entropy is such that the free-energy of the state with a droplet in it is much 
more favorable at any finite temperature T. We conclude that spin flips are favorable and 
order is destroyed at any finite temperature. The ferromagnetic Ising chain does not have 
a finite temperature phase transition. 

A similar argument in d > 1 suggests that one can have, as indeed happens, a finite 
temperature transition in these cases (see, e.g. [9]). 

At low temperatures, the structure of droplets, meaning patches in which the spins 
point in the direction of the opposite state, have been studied in detail with numerical 
simulations. Their knowledge has been used to derive phenomenological theories, the 
droplet model for systems with quenched disorder. At criticality one observes ordered 
domains of the two equilibrium states at all length scales - with fractal properties - and 
these have also been studied in detail. Right above T c finite patches of the system are 
indeed ordered but these do not include a finite fraction of the spins in the sample and 
the magnetic density vanishes. However, these patches are enough to generate non-trivial 
thermodynamic properties very close to T c and the richness of the critical phenomena. 

4.4 Classification 

Phase transitions are commonly classified by their order. The more common ones are 
those of first and second order. 

In first order phase transition the order parameter jumps at the critical point from a 
vanishing value in the disordered side to a finite value right on the ordered side of the 
critical point. This is related to the fact that a first derivative of the free-energy density 
diverges. 

In second order phase transition the order parameter is continuous at the transition, 
i.e. it smoothly departs from zero at the critical point, but its variation with respect 
to the conjugate field in the zero field limit, or linear susceptibility, diverges. This is a 
second derivative of the free-energy density. 

Higher order phase transitions appear when higher derivatives of the free-energy density 
diverge, i.e. when d n f/dy n is discontinuous the transition is of n-th order, where y is any 
argument of /. 

In disordered systems (see Sect. 5) a mixed case occurs in which the order parameter 
is discontinuous at the transition but all first derivatives of the free-energy density are 
finite. This is called a random first order transition and it provides a scenario for the 
glassy arrest [26]. 

4.5 Mean-field theory 

In spite of their apparent simplicity, the statics of ferromagnetic Ising models has been 
solved analytically only in one and two dimensions. The mean-field approximation allows 
one to solve the Ising model in any spatial dimensionality. Even if the qualitative results 
obtained are correct, the quantitative comparison to experimental and numerical data 
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shows that the approximation fails below an upper critical dimension d u . It is however 
very instructive to see the mean-field approximation at work. 



4.5.1 The naive mean-field approximation 

The naive mean-field approximation consists in assuming that the probability density of 
the system's spin configuration is factorizable in independent factors 

p(M) = n km with p i( s i) = ^p<w + l - z p 5 ^ ( 4 - 10 ) 

i=l 

and rrii = (sj), where the thermal average has to be interpreted in the restricted sense 
described in the previous sections, i.e. taken over one ergodic component, in a way that 
rrii 7^ 0. Note that one introduces an order-parameter dependence in the probabilities. 
Using this assumption one can compute the total free-energy 

F = U-TS (4.11) 

where the average is taken with the factorized probability distribution (4.10) and the 
entropy S is given by 

S=-£P(KninP({ Si }). (4.12) 

{*} 

One can use this approximation to treat finite dimensional models. Applied to the d- 
dimensional pure ferromagnetic Ising model with nearest-neighbor interactions on a cubic 
lattice Jy = J/2 for nearest-neighbors and zero otherwise. One finds the internal energy 

U = ~^J2( S i S j) ~ h 12( S i) = -^J2 m i m i ~hJ2 m i » ( 4 - 13 ) 

(ij) 1 (ij) * 

and the entropy 

N N 

s = - E U p ^k)^U p i( s i) 

Sj=±lfc=l 1=1 

N 

= -EE PiMtoPM 

1 = 1 Sl=±l 

^ 2 2 2 2 v ' 

For a uniformly applied magnetic field, all local magnetization equal the total density 
one, mi = m , and one has the 'order-parameter dependent' free-energy density: 



f(m) = —dJm 2 — hm + T 



1 + m , 1 + m 1 — m , 1 — m 
In 1 In 



(4.15) 



2 

The extrema, df(m)/dm = 0, are given by 

m = tanh ((32dJm + (5h) , (4.16) 
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with the factor 2d coming from the connectivity of the cubic lattice. This equation predicts 
a second order phase transition at T c = 2d J when h — 0. This result is qualitatively 
correct in the sense that T c increases with increasing d but the actual value is incorrect 
at all finite dimensions. In particular, this treatment predicts a finite T c in d = 1 which 
is clearly wrong. The critical behavior is also incorrect in all finite d, with exponents 
(see Sect. 4.6) that do not depend on dimensionality and take the mean-field values. 
Still, the nature of the qualitative paramagnetic-ferromagnetic transition in d > 1 is 
correctly captured. We postpone the study of the solutions to eq. (4.16) to the next 
Subsection where we shall treat a similar, and in some respects, more general case. Having 
an expression for the free-energy density as a function of the order parameter, that is 
determined by eq. (4.16), one an compute all observables and, in particular, their critical 
behavior. We shall discuss it below. 



4.5.2 The fully-connected Ising ferromagnet 

Let us now solve exactly the fully-connected Ising ferromagnet with interactions between 
all p uplets of spins in an external field: 

H=- J2 J ii-ip s h ■ ■ ■ % ~ S kiSi ' ( 4 - 17 ) 

Si — ±1, i — 1, . . . , N. For the sake of generality - and to include the disordered models 
to be discussed in Sect. 5 - we use a generic interaction strength J^...^. The ferromagnetic 
model corresponds to 

Ju i = , 3 , (4.18) 
v plNP' 1 

with < J = 0(1), i.e. finite, and p is a fixed integer parameter, p = 2 or p = 3 or that 
defines the model. The normalization with N p ~ l of the first term ensures an extensive 
energy in the ferromagnetic state at low temperatures, and thus a sensible thermodynamic 
limit. The factor p\ is chosen for later convenience. This model is a source of inspiration 
for more elaborate ones with dilution and/or disorder (see Sect. 5). Using the factorization 
of the joint probability density that defines the mean-field approximation, one finds 

F ({ m i}) = ~ J2 J h...ip m h ■■■ m i p -J2 h i m i 
h^—^ip i 



N 



=1 



1 + mj, 1 + rrii 1 — mi , 1 — m 
In + In 



(4.19) 



The local magnetizations, m h are then determined by requiring that they minimize the 
free-energy density, df({rrij})/dmi = and apositive definite Hessian, d 2 f ({rrij}) / dmidrrij 
(i.e. with all eigenvalues being positive at the extremal value). This yields 




mi = tanh p(3 £ ■■■m ip + fiK (4.20) 



If Jh-.tp = J/ (p\N p x ) for a\\p uplets and the applied field is uniform, hi = h, one can take 
rrii = m for all i and these expressions become (4.22) and (4.24) below, respectively. The 
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Figure 3: The free-energy density f(m) of the p = 2 (left), p = 3 (center) and p = 4 
(right) models at three values of the temperature T < T c (light dashed line), T = T c 
(dark dashed line) and T > T c (solid line) and with no applied field. 



mean-field approximation is exact for the fully-connected pure Ising ferromagnet. [Note 
that the fully-connected limit of the model with pair interactions (p = 2) is correctly 
attained by taking J — > J/N and 2d — > iV in (4.16) leading to T c = J.] 

Let us solve the ferromagnetic model exactly. The sum over spin configurations in the 
partition function can be traded for a sum over the a variable, x = N' 1 J2iLi Sj, that takes 
values x = -1, -1 + 2/N, -1 + 4/7V, . . . , 1 - 4/N, 1 - 2/N, 1. Neglecting subdominant 
terms in JV, one then writes 

Z = £ e -*W(*) (4.21) 



with the order-parameter dependent free-energy density 

1 — i™ 1 — r 

(4.22) 



/(x) = ——x v — hx + T 
p! 



1 + x 1 + rr 1 — £ 1 — rr 
m 1 m 



The first two terms are the energetic contribution while the third one is of entropic origin 
since N\/(N(1 + x)/2)!(iV(l — x)/2)\ spin configurations have the same magnetization 
density. 

In the large iV limit, the partition function can be evaluated using the saddle-point 
method (see Appendix 3) 

Z w Y, e- N ^ m ^ , (4.23) 

a 

where m" p are the absolute minima of f(x) given by the solutions to df(x)/dx\ msp = 0, 

m sp = tanh^^-^ y m^ 1 + ^ , (4.24) 

together with the conditions cPf(x)/dx 2 \ m a p > 0. 
High temperature 

In a finite magnetic field, eqn. (4.24) has a unique positive - negative - solution for 
positive - negative - h at all temperatures. The model is ferromagnetic at all temperatures 
and there is no phase transition in this parameter. 

2nd order transition for p = 2 
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In the absence of a magnetic field this model has a paramagnetic-ferromagnetic phase 
transition at a finite T c . The order of the phase transition depends on the value of p. This 
can be seen from the temperature dependence of the free-energy density (4.22). Figure 3 
displays f(x) in the absence of a magnetic field at three values of T for the p = 2 (left), 
p = 3 (center) and p = 4 (right) models (we call the independent variable m since the 
stationary points of f(x) are located at the magnetization density of the equilibrium and 
metastable states, see below). At high temperature the unique minimum is m = in all 
cases. For p = 2, when one reaches T c , the m = minimum splits in two that slowly 
separate and move towards higher values of \m\ when T decreases until reaching \m\ — 1 
at T = (see Fig. 3-left). The transition occurs at T c = J as can be easily seen from a 
graphical solution to eqn. (4.24), see Fig. 4-left. Close but below T c , the magnetization 
increases as m ~ (T c — T)*. The linear magnetic susceptibility has the usual Curie 
behavior at very high temperature, x ~ Pi an d it diverges as \ ~ \T — T c | _1 on both 
sides of the critical point. The order parameter is continuous at T c and the transition is 
of second-order thermodynamically. 

1st order transition for p > 2 

For p > 2 the situation changes. For even values of p, at T* two minima (and two 
maxima) at \m\ ^ appear. These coexist as metastable states with the stable minimum 
at m = until a temperature T c at which the three free-energy densities coincide, see 
Fig. 3-right. Below T c the m = minimum continues to exist but the \m\ ^ ones are 
favored since they have a lower free-energy density. For odd values of p the free-energy 
density is not symmetric with respect to m = 0. A single minimum at m* > appears at 
T* and at T c it reaches the free-energy density of the paramagnetic one, f(m*) = /(0), 
see Fig. 3-center. Below T c the equilibrium state is the ferromagnetic minimum. For all 
p > 2 the order parameter is discontinuous at T c , it jumps from zero at T c + to a finite 
value at T~. The linear magnetic susceptibility also jumps at T c . While it diverges as 
(T — Tc)' 1 on the paramagnetic side, it takes a finite value given by eqn. (4.26) evaluated 
at m* on the ferromagnetic one. In consequence, the transition is of first-order. 

Pinning field, broken ergodicity and spontaneous broken symmetry 

The saddle-point equation (4.24) for p = 2 [or the mean-field equation (4.16)] admit two 
equivalent solutions in no field. What do they correspond to? They are the magnetization 
density of the equilibrium ferromagnetic states with positive and negative value. Indeed, 
if we computed the averaged magnetization density with the partition sum restricted to 
the configurations with positive (or negative) value of x we find m = m sp . For T < T c if 
one computed this result from m = N' 1 J2iLi(si ) = J2x e~ l3N - f< - x ^x summing over the two 
minima of the free-energy density one finds m = as expected by symmetry. 

In practice, the restricted sum is performed by applying a small magnetic field, com- 
puting the statistical properties in the — > oo limit, and then setting the field to zero. 
In other words, 



1 N 

m ± =M^ Si ^ 



( 



PN dh 



1 dlnZ 



) 



df(m ap ) 
dh 



= ±\m 



(4.25) 



i=l 



The limit N — > oo taken in a field selects the positive (or negatively) magnetized states. 
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Figure 4: Graphical solution to the equation fixing the order parameter x for p = 2 (left), 
p = 3 (center) and p = 4 (right) ferromagnetic models at three values of the temperature 
T < T*, T = T* and T > T* and with no applied field. Note that the rhs of this equation 
is antisymmetric with respect tom-> — m for odd values of p while it is symmetric under 
the same transformation for even values of p. We show the positive quadrant only to 
enlarge the figure. T* is the temperature at which a second minimum appears in the 
cases p = 3 and p = 4. 



The magnetic linear susceptibility is given by 

P 



dm 
X= ~dh 



dm sp 



cosh^mST 1 )-/^- (426) 



h ^Q± won V(p-1)! 



For even p the two magnetized states have the same divergent susceptibility at T c . 

For odd values of p there is only one non-degerate minimum of the free-energy density 
at low temperatures and the application of a pinning field is then superfluous. 

The existence of two degenerate minima of the free-energy density, that correspond to 
the two equilibrium ferromagnetic states at low temperatures, implies that ergodicity is 
broken in these models. In pure static terms this means that one can separate the sum 
over all spin configurations into independent sums over different sectors of phase space 
that correspond to each equilibrium state. In dynamic terms it means that temporal and 
statistical averages (taken over all configurations) do not coincide. 

For any even value of p and at all temperatures the free-energy density in the absence 
of the field is symmetric with respect tom-> — m , see the left and right panels in Fig. 3. 
The phase transition corresponds to a spontaneous symmetry breaking between the states 
of positive and negative magnetization. One can determine the one that is chosen when 
going through T c either by applying a small pinning field that is taken to zero only after 
the thermodynamic limit, or by imposing adequate boundary conditions. Once a system 
sets into one of the equilibrium states this is completely stable in the N — > oo limit. 

For all odd values of p the phase transition is not associated to symmetry breaking, since 
there is only one non-degenerate minimum of the free-energy density that corresponds to 
the equilibrium state at low temperature. 



4.5.3 Landau theory 

The exercise that we solved in the last subsection corresponds to a fully solvable model 
for which mean-field theory is exact. Now, can one get an idea of the limit of validity of 
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mean-field theory and when it is expected to fail? 

Landau proposed an extension of Weiss mean-field theory for ferromagnets (Sect. 4.5) 
that has a much wider range of application, includes space, allows to predict when it 
applies and when it fails. In a few words - we shall not develop Landau theory here - it 
consists in proposing a field theory for coarse-grained fields that represent the averaged 
relevant variable. In the case of an Ising spin system, the field in each coarse-graining 
volume v = £ d within the sample is defined as: 

m = \ d E *i > (4-27) 
and the effective free-energy of the interacting system is proposed to be 

F(<f>) = J d d x 



c(V0(f)) 2 + A04 (f) + ^_^ 2 (f) 



(4.28) 



The first term mimics an elastic energy related to the ferromagnetic interactions. The 
second term is an expansion of the entropic contribution in powers of T — T c that is 
expected to be valid only close to T c . One keeps the leading terms in the Taylor expansion. 
Note that this 'order-parameter dependent' free-energy is not quadratic due to the term 
4 . If one then integrates over all configurations - to compute the partition function - 
and then evaluates this integral with a saddle-point approximation one can also include 
the fluctuations (see Appendix 3) and see when these ones become too important and 
thus make the saddle-point evaluation invalid. This analysis is called Ginzburg criterion 
and tells us that there is an upper critical dimension, 

d u = 4 (4.29) 

above which mean-field theory is exactl and below which it fails. However, it does not fail 
everywhere in parameter space. It just fails when one gets very close to the critical point. 
How close, it depends on the system, and this is called the critical region. The behavior 
of is then well-described by the Landau- Ginzburg theory away from the critical region. 
Inside the critical region the approach fails. In particular, it predicts the mean-field 
exponents in (4.34) for all d. 

Landau and Ginzburg got Nobel Prizes in Physics that were strongly associated to 
their work along these lines. 



4.6 Critical exponents 

The rest of the discussion will focus on second order phase transitions for which the order 
parameter departs from zero smootly when entering the ordered phase. 

When studying the observables close to the critical point one realizes that they depend 
on the distance from the critical point in the form of power laws. For instance, in zero 
field the order parameter increases as 

m ~ (T c - Tf (4.30) 

while at T c and as a function of the conjugate field it behaves as 

m ~ h 1/s . (4.31) 
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The divergence of the linear susceptibility at T c is characterized by two exponents 



X 



(T - T c )-t 
(T c -T)-V 



The specif heat also diverges at T c : 



C 



v 




T>T C 
T <T r 



T>T C 
T <T r 



(4.32) 



(4.33) 



While the values of T c are material dependent, all ferromagnetic transitions of systems 
in d = 3 that with an order parameter of the same dimensionality can be described by 
the same - within error bars - critical exponent! This feature indicates the existence 
of universality classes, i.e. groups of systems for which the details of the microscopic 
interactions do not matter and whose macroscopic critical behavior is identical. 

It is simple to compute the exponents in the naive mean-field approximation for any d 
or for the fully connected model with p — 2. They read 



a = , (3 = 



2 ' 



7 = 1, 5 = 3, i] = , v — 



1 

2 ' 



(4.34) 



and they are independent of d. These values are to be confronted to the experimental 
values. In ferromagnetic phase transition of Ising symmetry they are 



d 


P 


a 


7 


5 


V 


V 




2 


1/8 





7/4 


15 


1 


1/4 


exact 


3 


0.325 


0.11 


1.24 


4.82 


0.63 


0.032 


approx 



Table 2: Critical exponents in the Ising universality class. 



4.7 Towards an understanding of critical phenomena 

The fact that very different systems share the same critical properties, the mere exis- 
tence of universality classes, suggested that a very general framework should be able to 
describe the critical behavior of all these systems at once. The fact that the mean-field 
critical exponents were slightly different from the observed ones was not very important 
as a quantitative disagreement but it was from a fundamental point of view. Something 
important was going on and needed an explanation. 

In this Subsection we introduce and discuss the concepts that allowed a qualitative 
and quantitative understanding of critical phenomena. The ideas and methods introduced 
actually go beyond this problem and have been exported to other situations like dynamical 
processes in and out of equilibrium (see Sects. 6 and 7). 

4.7.1 Scaling 

We have defined 6 critical exponents (a for the specific heat, (3 for the order parameter, 
7 for the susceptibility, 5 for the order parameter at the critical point as a function 
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of the conjugate field, 77 for the correlation function and v for the correlation length). 
But, actually, not all these exponents are independent, the scaling hypothesis allows one 
to show that only two of them are. One example of these relations is the one called 
Rushbrooke scaling law 

a + 2(3 + 7 = 2 . (4.35) 

These relations follow from the fact that all observables can be computed from the free- 
energy density and, if one assumes a scaling form for it, one is forced to have relations 
between the exponents characterizing the thermodynamic observables and correlation 
functions. We shall not discuss these relations here, the interested student can find them 
in any book on critical phenomena, see e.g. [9]. 

Let us take a simple viewpoint and discuss a way to collapse data close to a critical 
point, a property closely related to scaling. The power law expressions (4.30) and (4.31) 
suggest Widom scaling for the order parameter: 

m(t, h) ~ \tfQ ± (J^"j t ee , (4.36) 

with $±(0) = 1 and $ ± (x — > 00) ~ x 1 ^ . With these limits, (4.31) is recovered on the 
critical isotherm and (4.30) follows at strictly zero field and for \t\ <C I. An example of 
data collapse is given in Fig. 5. 

Surprisingly enough, all systems undergoing a ferromagnetic transition can be scaled 
in this way using the same functions $± above and below the critical temperature, re- 
spectively! The way of checking this hypothesis is by plotting m/\t\P against for 
different systems and looking for data collapse. Of course, we do not know the values of 
the universal exponents (3 and 5 and the material dependent critical temperature T c a 
priori, so we need to manipulate a bit the data before obtaining collapse. Note that the 
scaling law (4.36) is independent of the dimension d. 

One can also write the scaling hypothesis for the free-energy density and derive the 
one for the magnetization by differentiation. 

Scaling relations that involve the dimension are called hypers caling. The correlation 
function satisfies 

G ^M = ^9(rW,^ (4.37) 

Kadanoff proposed that this quite incredible feature could be explained assuming that 
near a critical point a system looks the same at all length scales. This is called scale 
invariance. 

4.7.2 The correlation length 

A very important concept in critical phenomena is that of a correlation length usually 
denoted by £. Quantitatively, it describes the extent of fluctuations in a physical quantity 
about its average. At finite temperature, one can have droplets of the wrong phase within 
the correct one, due to thermal agitation. The size of these droplets will be a function of 
temperature and at a given instant, a snapshot of the system would reveal the existence 
of a number of them with different sizes. One expects though that they have a well- 
defined average (taken, for instance, over different snapshots taken at different times). 
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Fig. 4 ' 4 ' 3 ' Schematic phase boundary for the liquid-gas transition showing 
the asymmetry of the order parameter about the critical density. 




Fig. 4.4,4. Phase boundary in units of reduced temperature and density foi 
eight different molecular fluids near their liquid-gas transitions. Now the 
universal behavior and the fact that the solid line is &4> cc (T t - Tf with 
- 1/3 rather than the mean-field result - 1/2. [EA, Guggenheim, J. 
Chan. 13, 253 (1945) ] 

Figure 5: Critical scaling in gas-liquid transitions at constant pressure. At very low 
density and low temperature, at the left of the curve the system is a gas, at very large 
density and still low temperature, at the right of the curve the system is a liquid. In the 
region below the curve there is coexistence of gas and liquid. Above the curve the system 
goes continuously from a gas to a liquid when increasing the density. The critical line 
behaves as \pi — p a \ ~ |T — T c \ with (3 ~ 0.327 close to the maximum. Note that scaling 
holds as far as T/T c ~ 0.55! 



This average size can be taken as a qualitative indication of the value of the correlation 
length (we shall give a more precise definition below) and one expects it to be very short, 
of the order of the lattice spacing, at very low temperature, to increase when approaching 
T c , actually to diverge at T c , and then to decrease again in the high temperature phase 
when getting away from the critical point. See Fig. 6. 

The fact that one finds coherent structures at all lengths at the critical point means that 
there is no spatial scale left in the problem and then all scales participate in the critical 
behaviour. The system becomes scale invariant and, if one looks at it with different 
microscopes one essentially sees the same. 

The actual definition of the correlation length is based on the use of the static sucep- 
tibility sum rule. Let us define the two-point connected correlation function 

G(fi,fj) = (siSj) - (si)(sj) (4.38) 

where r*j is the spatial position of the spin Sj. A simple calculation allows one to show 
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that the linear susceptibility reads 

dm h d 2 f 

X = 



dh dh 2 



= •§£^-3). (4.39) 

h=0 ij 



Let us prove this statement. Nothing indicates that spatial translational invariance should 
be violated, thus, the correlation should be a function of the distance between the points 
fi and fj only: 

G{r i ,r j ) = G{r i -f j ) . (4.40) 

One then has 

X - ^ = P E G(rl) = lj v d d r G{r) . (4.41) 

This means that the divergence of the susceptibility at the critical point must be accom- 
panied by a special behavior of the correlation function. Indeed, one finds that 

G ^ - rd -ff 3) /2 > for r » e • (4.42) 

This expression is integrable over the full volume unless the exponential factor disappears. 
This is indeed what happens at T c where the correlation length diverges, again as a power 
law of the distance to the critical point 

£~\T-T C \- V . (4.43) 

Finally, one observes that the correlation function right at the critical point also diverges 
as a power law: 

G{r) ~ r - {d - 2+v) , (4.44) 

with rj another critical exponent. 
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Let us discuss the correlation length in a simple solvable case, the Ising model in d = 1 
with, say, open boundary conditions. In this case, the finite temperature correlation 
function is 

Gki = (s k si) - {s k ){si) = (s k si) (4.45) 

since (s k ) = at any T > 0. Introducing, for convenience, different coupling constants 
Ki = f3Ji on the links, G kt reads 

G kl = Z-' £ eE,*.,^ z -i ^ 0_ 8_ z . (4.46) 

dK k dK k+1 dK x _ x 

At the end of the calculation one takes Ki = K = f3J for all i. Thus, at finite temperature 
the connected correlation between any two spins can be computed as a number of deriva- 
tives (depending on the distance between the spins) of the partition function conveniently 
normalized. Using the change of variables rji = SjS i+ i, one finds 

N-l 

Z = ]T e£i*"* = 2 JJ 2cosh(Ki) -> 2(2 cosh/? J)^" 1 . (4.47) 

{Vi=i} i=1 

Taking the distance between the chosen spins s k and si to be k — I = r the correlation 
function is then given by 

G(r) = [tanh(/?J)] r = e rln[tanh(,3J)1 = e" r/f (4.48) 

with 

In this one dimensional example we found a essential singularity, an exponential diver- 
gence, of the correlation length when approaching T c = 0. In general, in higher d, one has 
a power law divergence of the form (4.44). 



4.7.3 Finite size effects 

A real system is large but finite, 1 N A ~ 10 23 < oo. Finite size effects will then play a 
role in the phase transition that is rounded by the fact that N A < oo. Finite size effects 
become important when £ ~ L, the linear size of the system, say L ~ 1cm for an actual 
sample. A rough estimation of how close to T c one needs to get to see deviations from 
critical scaling shows that finite size effects are quite negligible in experiments but are 
certainly not in numerical simulations and have to be taken into account very carefully 
when trying to compare numerical data to analytical predictions. 

Finite size effects are taken into account by introducing correcting functions in the 
scaling laws, for example, 




(4.50) 



with g(x — > oo) — > 1 and g(x — > 0) — > x 1 ^ ', and 

m ~ L^ u . (4.51) 
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4.7.4 The renormalization group 

The development of the renormalization group by K. Wilson in the early 70s gave a 
totally new way of understanding condensed-matter and particle physics phenomena. 
He transformed the picture of phase transitions that developed in the 60s - with the 
understanding of concepts like scaling, universality and correlations - into a calculational 
tool and got the Nobel Prize in Physics in 1982. 

Let us describe how the RG works on one example, again the one dimensional Ising 
model with N (even) spins in the absence of an applied field. The partition function reads 

Z(N,J)= e pJ ^=± SiSi+1 (4.52) 

Si=±l 

We shall call K = f3J. The sum in the exponential is 

s l s 2 + s 2 s 3 + s 3 s 4 + S4S5 + . . . (4.53) 

It is clear that s 2 enters only in the first two terms, S4 enters only on the third and forth 
term and so on and so forth. One can then sum over all configurations of the spins with 
an even label. For example, the sum over s 2 = ±1 yields 

e K( S1 +s 3 ) _|_ e -K(si+s 3 ) _ (4.54) 

One then obtains: 

N-l 

Z(N, K)= Yl II [e K(M+2) + e - K{s ' +s '+ 2) ] (4.55) 

Sj=±l;i odd i=l 

If we find a function k(K) and a new coupling constant K' such that each of these terms 
can be written as 

the right-hand-side in eq. (4.55) would be proportional to the partition function of another 
one dimensional Ising model with N/2 spins and a different coupling constant K'\ 

Z(N, K) = k(K) n/2 Z(N/2, K') . (4.57) 

Indeed, the solution to (4.56) can be easily found; it is enough to consider all the cases 
Si = ±1 and s i+2 = ±1 to obtain: 

K' = ihicosh(2fT) , (4.58) 
k{K) = 2cosh 1/2 (2fT) . (4.59) 
Now, we know that the free-energy, and in particular In Z, should be linear in iV; 

-/3F(N, K) = In Z(N, K) = N((K) . (4.60) 

Thus, taking the In of eq. (4.62) and using (4.60) 

N 

kiZ(N,K) = — In k(K) + In Z(N/2, K') , (4.61) 
C(K) = ±\nK(K) + ±((K'). (4.62) 
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or equivalently 



ln[2 cosh 1/2 (2K) + 2({K)} 



(4.63) 



Inserting a value of K on the right-hand-side one obtains the new coupling constant K' 
and the new free-energy C,{K'). 

Equations (4.58) and (4.63) provide recursion relations for the coupling constant and 
the partition function. Note that K' is always smaller than K. One can also solve for K 
as a function of K'\ 



obtaining now an increasing flow K(K'). 

One can use these results to compute the value of the partition function for any K. 
The argument goes as follows. For very small K' ', i.e. very high temperature, the spins 
are basically independent and Z(K') ~ 2^ and ({K') = In 2. Using then (4.64) and (4.65) 
one computes K and C(^0- O ne then iterates using these values as starting points K' 
and C,{K'). The agreement between the values generated this way and the results of the 
exact calculation are quite amazing (see, e.g [13]). 

The process described in the previous paragraph is a flow in the space of parameters. 
Starting from any non-zero value of the coupling constant the iteration converges to 
K — > oo. There are then two fixed points, a stable fixed point at K — > oo (to which 
trajectories are attracted) and an unstable fixed point K = from which trajectories 
depart. The stable fixed point is the critical point one is looking for. The critical behavior 
can then be obtained from the dependence of on the parameter K — K c . 

A similar procedure can be applied to the d — 2 problem. In this case, however, the 
decimation of spins cannot be done exactly, as in d = 1, and one is forced to use some 
approximation. There a number of successful recipes in the literature. Basically, one finds 
that the flow in parameter space approaches a stable fixed point at finite value of K which 
corresponds to the phase transition. 

4.7.5 Fluctuations of macroscopic observables 

A direct consequence of having a diverging correlations length is that the critical measure- 
to-measure fluctuations global observables like, for instance, the magnetization density, 
are not Gaussian. The reason is simple, if L < £, a global measurement is not the result 
of an average over many uncorrelated regions and, thus, one can not use the central limit 
theorem to argue for a normal distribution of fluctuations. Recently, the study of critical 
fluctuations of macroscopic observables received much attention. The best adapted model 
for this analysis is the 2d XY model, that is critical on a finite interval of temperatures 
(and not only at a single precise value of T c ) [27]. This model, in an approximation, is 
mapped onto an interface model, the Edwards- Wilkinson one, that we shall discuss in 
Sect. 7. 



K = -cosh-V*') 
2 v ; 



(4.64) 



(4.65) 
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In the limit we are now interested in L is finite with respect to £ and finite size effects 
are important. Finite size scaling implies 



The calculation of this probability function in the so-called spin-wave approximation of 
the 2d XY model showed that it has the form of Gumbel distributions found in the staudy 
of extreme value statistics that we discussed in Sect. 3.1.2. The role payed by extreme We 
values in this context remains, however, not clear, shall give more details on the analysis 
of these non-Gaussian fluctuations in the language of interfaces. 




(4.66) 
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5 Disordered systems 

No material is perfectly homogeneous: impurities of different kinds are distributed ran- 
domly throughout the samples. 

A natural effect of disorder should be to lower the critical temperature. Much atten- 
tion has been payed to the effect of weak disorder on phase transitions, that is to say, 
situations in which the nature of the ordered and disordered phases is not modified by 
the impurities but the critical phenomenon is. On the one hand, the critical exponents of 
second order phase transitions might be modified by disorder, on the other hand, disorder 
may smooth out the discontinuities of first order phase transitions rendering them of sec- 
ond order. Strong disorder instead changes the nature of the low-temperature phase and 
before discussing the critical phenomenon one needs to understand how to characterize 
the new ordered 'glassy' phase. 

In this Chapter we shall discuss several types of quenched disorder and models that 
account for it. We shall also overview some of the theoretical methods used to deal with 
the static properties of models with quenched disorder. 

5.1 Quenched and annealed disorder 

First, one has to distinguish between quenched and annealed disorder. Imagine that one 
mixes some random impurities in a melt and then very slowly cools it down in such a way 
that the impurities and the host remain in thermal equilibrium. If one wants to study 
the statistical properties of the full system one then has to compute the full partition 
function in which one sums over all configurations of original components and impurities. 
This is called annealed disorder. In the opposite case in which upon cooling the host 
and impurities do not equilibrate but the impurities remain blocked in random fixed 
positions, one talks about quenched disorder. Basically, the relaxation time associated 
with the diffusion of the impurities in the sample is so long that these remain trapped. 
The former case is easier to treat analytically but is less physically relevant. The latter 
is the one that leads to new phenomena and ideas that we shall discuss next. 

5.2 Bond disorder: the case of spin-glasses 

Spin-glasses are alloys in which magnetic impurities substitute the original atoms in posi- 
tions randomly selected during the chemical preparation of the sample. The interactions 
between the impurities are of RKKY type: 

cos(2fc F rtj) 

Vrkky " 3 SjSj 

r ij 

with = \ fi — fj\ the distance between them and a spin variable that represents their 
magnetic moment. Clearly, the initial location of the impurities varies from sample to 
sample. The time-scale for diffusion of the magnetic impurities is much longer than the 
time-scale for spin flips. Thus, for all practical purposes the positions f; can be associated 
to quenched random variables distributed according to a uniform probability distribution 
that in turn implies a probability distribution of the exchanges. This is called quenched 
disorder. 
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Figure 7: A frustrated (left) and an unfrustrated (center) square plaquette. A frustrated 
triangular plaquette (right). 



5.2.1 Frustration 

Depending on the value of the distance the numerator in eqn. (5.1) can be positive or 
negative implying that both ferromagnetic and antiferromagnetic interactions exist. This 
leads to frustration, which means that some two-body interactions cannot be satisfied by 
any spin configuration. An example with four sites and four links is shown in Fig. 7-left, 
where we took three positive exchanges and one negative one all, for simplicity, with the 
same absolute value, J. Four configurations minimize the energy, Ef = — 2J, but none 
of them satisfies the lower link. One can easily check that any closed loop such that the 
product of the interactions takes a negative sign is frustrated. Frustration naturally leads 
to a higher energy and a large degeneracy of the number of ground states. This is again 
easy to grasp by comparing the number of ground states of the frustrated plaquette in 
Fig. 7-left to its unfrustrated counterpart shown on the central panel. Indeed, the energy 
and degeracy of the ground state of the unfrustrated plaquette is E u = —AJ and n u = 2, 
respectively. 

Frustration may be due to pure geometrical constraints. The canonical example is an 
antiferromagnet on a triangular lattice in which each plaquette is frustrated, see Fig. 7. 



5.2.2 Self-averageness 

If each sample is characterized by its own realization of the exchanges, should one expect 
a totally different behavior from sample to sample? Fortunately, many generic static and 
dynamic properties of spin-glasses (and other systems with quenched disorder) do not 
depend on the specific realization of the random couplings and are self- averaging. This 
means that the typical value is equal to the average over the disorder: 

Ay p = [Aj] av . (5.2) 

In particular, the free-energy of models with short-ranged interactions is expected to be 
self-averaging. 

The meaning of this property can grasped from the solution of the random bond Ising 
chain defined by the energy function E = —J2i Ji-Si-Si+i with spin variables = ±, for 
% = 1, . . . , N and random bonds J, independently taken from a probability distribution 
P(Ji). For simplicity, we consider periodic boundary conditions. The disorder-dependent 
partition function reads 

Zj= Yl e p ^i JiSiSi+1 (5.3) 

{Si=±l} 
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and this can be readily computed introducing the change of variables <7j = SiS i+1 . One 
finds. 

Zj = JJ 2 cosh(/3 Ji) -[3Fj = J2^cosh(f3Ji) + Nln2 . (5.4) 

i i 

The partition function is a product of i.i.d. random variables and it is itself a random 
variable with a log-normal distribution. The free-energy instead is sum of i.i.d. and, using 
the central limit theorem, it becomes a Gaussian random variable that is narrowly peaked 
about its maximum in the large N limit. The typical value, determined by the maximum 
in the distribution coincides with the average. 

A simple argument that justifies the self-averageness of the free-energy in generic finite 
dimensional systems with short-range interactions is the following. Let us divide the 
system of volume V in n subsystems of volume v with V = nv. If the interactions are 
short-ranged, the total energy is the sum of two terms, a contribution from the bulk of 
the subsystems and a contribution from the interfaces between the subsystems. In the 
thermodynamic limit, the latter is negligible with respect to the former. The disorder 
dependent free-energy density is then a of random numbers, each one being the disorder 
dependent free-energy of the bulk of each subsystem. In the limit of a very large number 
of subsystems (V 3> v ) the central limit theorem implies that the total free-energy is 
Gaussian distributed with the maximum reached at a value Fj that coincides with the 
average of all realizations of the randomness [Fj] . Thus, the typical Fj is identical to the 
averaged [Fj] av . One can then compute the latter to understand the static properties of 
the typical system. 

Once one has [Fj], one derives all disordered average thermal averages by taking deriva- 
tives of the disordered averaged free-energy with respect to sources introduced in the 
partition function. For example, 

d[Fj] av 



[(Si) 



dhi 



hi=0 

[{s i s j )-{s i ){s j )] av =T 

with E -> E-Y.X hiSi. 



d[Fj] av 



dhihj 



(5.5) 
(5.6) 



hi=0 



5.3 Models 

In early 70s Edwards and Anderson proposed a rather simple model that should capture 
the main features of spin-glasses. The interactions (5.1) decay with a cubic power of the 
distance and hence they are relatively short-ranged. This suggests to put the spins on 
a regular cubic lattice model and to trade the randomness in the positions into random 
nearest neighbor exchanges taken from a Gaussian (or bimodal) probability distribution: 

E ea = -J2 J ir S i S j with P ( J ij) = {^^Y^ . (5.7) 

(ij) 

A natural extension of the EA model in which all spins interact has been proposed by 
Sherrington and Kirkpatrick 

E = - ^ JijSiSj - (5.8) 



33 



5.3 Models 



5 DISORDERED SYSTEMS 



and it is called the SK model. The interaction strengths are taken from a Gaussian 
pdf and they scale with N in such a way that the thermodynamic is non-trivial: 

P(J i3 ) = (27r<7^)-3 e'^i a 2 N = a 2 N . (5.9) 

This is a case for which a mean-field approximation is expected to be exact. 
A further extension of the EA model is called the p spin model 

E=- J2 J ii-ip S ii--- s i p -Yl h i S i ( 5 - 10 ) 

with p > 3. The exchanges are now taken from a Gaussian probability distribution with 

2 j2 

[Ji 1 ...i p ]av = , [Jn...i p ]av — 2jyp-l ' (5-11) 

Indeed, an interesting thermodynamic limit is achieved by scaling J^...^ with jV - ^ -1 )/ 2 . 
This scaling is justified as follows. The "force" F = J2u 2 ^i p ■hi 2 ...i v m i2 ■ ■ ■ m %> should be of 
order one. At low temperatures the m^s take plus and minus sign. In particular, we can 
estimate the order of magnitude of this term working at T = and taking rrii — ± with 
probability \. In order to make the discussion simple, let us take p — 2. In this case, 
if the strengths are of order one, Fj is a sum of N independent, identically distributed 
random variables, with zero mean and variance equal to 1. Thus, F has zero mean and 
variance equal to N. Thus, one can argue that Fj is of order \/N. To make it finite we 
then chose Jy to be of order l/y/N or, in other words, we impose [J 2 j] a v = J 2 /(2N). The 
generalization to p > 2 is straightforward. 

Cases that find an application in computer science are defined on random graphs with 
fixed or fluctuating finite connectivity. In the latter case one places the spins on the 
vertices of a graph with links between couples or groups of p spins chosen with a probability 
c. These are called dilute spin-glasses. 

Let us now discuss some, a priori simpler cases. An example is the Mattis random 
magnets in which the interaction strengths are given by 

•A:...;, with £i = ± with p=l/2. (5.12) 

In this case a simple gauge transformation, r\i = ^Sj, allows one to transform the disor- 
dered model in a ferromagnet, showing that there was no true frustration in the system. 

Random ferromagnets are systems in which the strengths of the intearctions are not 
all identical but their sign is. Thus, there is no frustration in this systems either. 

Link randomness is not the only type of disorder encountered experimentally. Random 
fields, that couple linearly to the magnetic moments, are also quite common; the classical 
model is the ferromagnetic random field Ising model: 

i _ h \ 

Erfim = -Jj2 S i S 3 ~ S i k i with P (^) = {^^Y 1 e ■ (5.13) 

(ij) * 

Models with site or link dilution are also interesting: 

Esitedil J^2i{ij) SiSj^i^j j ^link dil J'Yl(ij) SiSj^ij ; • (5-14) 



34 



5.4 The Griffiths phase 



5 DISORDERED SYSTEMS 



with P(ej = 0, 1) = p, 1 — p in the first case and P(ey = 0,1) = p, 1 — p in the second. 
The dilute antiferromagnet in a uniform magnetic field is believed to behave similarly to 
the ferromagnetic random field Ising model. Experimental realizations of the former are 
common and measurements have been performed in samples like Rb2Coo.7Mgo.3F4. 

Once again, disorder is not only present in magnetic systems. An example that has 
received much attention is the case of a d dimensional directed manifold moving in an 
embedding N + d dimensional space under the effect of a quenched random potential. The 
simplest case with d — corresponds to a particle moving in an embedding space with 
N + d dimensions. If, for instance JV = 1, the particle moves on a line, if N = 2 it moves 
on a plane and so on and so forth. If d = 1 one has a line that can represent a polymer, a 
vortex line, etc. The fact that the line is directed means that it does not have overhangs. 
If the line moves in a plane, the embedding space has (N — 1) + (d — 1) dimensions. The 
quenched random potential is, say, V(x), with x the coordinate on the embedding space 
and one characterises it with its probability distribution function P(V). 

5.4 The Griffiths phase 

The effects of quenched disorder shows up already in the paramagnetic phase of a finite 
dimensional model, as the ones introduced above. Below the critical point of the pure 
system (no disorder) one sees finite regions of the system that order due to fluctuations in 
the couplings that can make certain parts of the sample more ferromagnetic than others. 
These properties manifest in non-analyticities of the free-energy that appear in a full 
interval of temperatures above the critical temperature of the disordered model, as shown 
by Griffiths. We shall not discuss these features in detail here. 

5.5 The spin-glass transition 

Let us now discuss a problem in which disorder is so strong as to modify the nature of the 
low temperature phase. If this is so, one needs to define a new order parameter, capable 
of identifying order in this phase. 

5.5.1 The order parameter 

The spin-glass equilibrium phase is one in which spins "freeze" in randomly-looking con- 
figurations. In finite dimensions these configurations are spatially irregular. A snapshot 
looks statistical identical to a high temperature paramagnetic configuration in which spins 
point in both directions. However, while at high temperatures the spins flip rapidly and 
another snapshot taken immediately after would look completely different from the pre- 
vious one, at low temperatures two snapshots taken at close times are highly correlated. 

In a spin-glass state the local magnetization is expected to take a non-zero value, 
m i — ( s i ) 7^ 0> where the average is interpreted in the restricted sense discussed above, 
i.e. within a pure state. Instead, the global magnetization density, m = N' 1 J2iLi m ii 
vanishes since one expects to have as many spins pointing up as spins pointing down 
with each possible value of \rrii\. Thus, the total magnetization, m = A^ -1 of a 

spin-glass vanishes at all temperatures and it is not a good order parameter. 
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The spin-glass transition is characterized by a finite peak in the linear magnetic sus- 
ceptibility and a diverging non-linear magnetic susceptibility. Let us discuss the former 
first and show how it yields evidence for the freezing of the local magnetic moments. For 
a generic magnetic model such that the magnetic field couples linearly to the Ising spin, 
E — > E — Y,i hiSi, the linear susceptibility is related, via the static fluctuation-dissipation 
theorem to the fluctuations of the magnetization: 

= <(*-<*»(*i -<**»>■ (5-15) 

h=0 

This relation is proven by using the definition of (s«) and simply computing the derivative 
with respect to hj. In particular, 

X« = P(l-m?) >0, (5.16) 
with TOj = (sj). The total susceptibility measured experimentally is x = j?J2ijXij- 

On 

the experimental side we do not expect to see order one sample-to-sample fluctuations in 
this global quantity. On the analytical side one can use a similar argument to the one 
presented in Sect. 5.2.2 to argue that x should be self-averaging. Thus, the experimentally 
observed susceptibility should be given by 

[X]av = N- 1 YXXiiU » N-'YXxuU = N- 1 (l - K 2 W) , (5.17) 

ij i i 

since we can expect that cross-terms cancel under the disorder average. The fall of x at low 
temperatures with respect to its value at T c signals the freezing of the local magnetizations, 
mi, in non-zero values. Note that this argument is based on the assumption that the 
measurement is done in equilibrium. 

Thus, the natural global order parameter that characterizes the spin-glass transition is 

qea^N^YXmlU (5.18) 

i 

as proposed in the seminal Edwards-Anderson paper, q vanishes in the high tempera- 
ture phase since all to; are zero but it does not in the low temperature phase since the 
square power takes care of the different signs. Averaging over disorder eliminates the site 
dependence. Thus, is also given by 

q e a=[m 2 i ] av . (5.19) 

In all this discussion the statistical average has to be interpreted in the same restricted 
sense as the one discussed in the paramagnetic ferromagnetic transition of the usual 
Ising model (see Sect. 4). Whether there are two equilibrium states (as in the usual 
ferromagnetic phase) or many more (as we shall see appear in fully-connected spin-glass 
models) the average is taken in one of them. Since in the latter case it is found that even 
if the individual m 8 take different values in different states, q is 'state-independent', this 
definition does not allow to distinguish between the two scenarios. 

We shall see below a more pertinent definition of an order parameter that allows one to 
distinguish between the simple, two-state, and the many state scenarios. In practice it is 
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related to the definition of the overlap - or correlation - between two spin configurations, 
say {.Si} and {crj}, defined as 

Q«, = N- 1 Y,s i a i . (5.20) 

i 

q sa takes values between O and 1. It equals one if {s;} and {o-j} differ in a number 
spins that is smaller than O(N), is equals —1 when the two configurations are totally 
anticorrelated - with the same proviso concerning a number of spins that is not O(N) and 
it equals zero when {sj} and {at} are completely uncorrelated. Note that the self-overlap 
of a configuration with itself is identically one for Ising spins. 



5.5.2 Divergent susceptibility 



In a pure magnetic system with a second-order phase transition, see the discussion in 
Sect. 4, the susceptibility of the order parameter to a field that couples linearly to it 
diverges at the transition. Now, via the static fluctuation-dissipation theorem, the the 
variation of an observable with respect to the field that couples linearly to it is equal to 
the correlation of the observable with itself: 



d(O t ) 



Shj 



(5.21) 



h=0 



Having identified the spin-glass order parameter q = [m}] av that is linearly related to 
[Xu]av one can define the spin-glass susceptibility as 



XSG ^ Xij 

ij 



ij 



(5.22) 



and one can check whether it diverges at T c as expected in a second-order phase transition. 
One can show that the divergence of xsg is related to the divergence of the non-linear 
usual magnetic susceptibility 
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dh 3 
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(5.23) 



h=0 



that can be accessed experimentally. A careful experimental measurement of X^ 
and x^ demonstrated that all these susceptibilities diverge at T c . 



5.6 Statics of models with strong disorder 

Let us discuss in this part the statics of spin-glass models and the two main competing 
descriptions of it. Similar arguments can be used to study other models with strong 
disorder, as a manifold in a random potential. 



5.6.1 Scaling arguments and the droplet theory 

The droplet theory is a phenomenological model that assumes that the low temperature 
phase of a spin-glass model has only two equilibrium states related by an overall spin flip. 
It is then rather similar to a ferromagnet, only that the nature of the order in the two 
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equilibrium states is not easy to see, it is not just most spins pointing up or most spins 
pointing down with some thermal fluctuations within. At a glance, one sees a disordered 
paramagnetic like configuration and a more elaborate order parameter has to be measured 
to observe the order. The spin-glass phase is then called a disguised ferromagnet and a 
usual spontaneous symmetry breaking (between the two equilibrium states related spin 
reversal symmetry) leading to usual ergodicity breaking is supposed to take place at T c . 

Once this assumption has been done, renormalization group arguments are used to 
describe the scaling behavior of several thermodynamic quantities. The results found are 
then quantitatively different from the ones for a ferromagnet but no novelties appear. 

Let us now just discuss one simple argument that is at the basis of what is needed 
to derive the results of the droplet theory without entering into the complications of the 
calculations. 

The Imry-Ma argument 

Take a ferromagnetic Ising model in a random field, defined in eq. (5.13). In zero 
applied field and low enough temperature, if d > 1 there is phase transition between a 
paramagnetic and a ferromagnetic phase. If one considers the effect of a random field 
with very strong typical strength, the the spins will align with the local external fields 
and the system will be paramagnetic. However, It is, however, non-trivial to determine 
the effect of a relatively weak random field on the ferromagnetic phase at sufficiently low 
temperature. The long-range ferromagnetic order could be preserved or else the field could 
be enough to break up the system into large but finite domains of the two ferromagnetic 
phases. 

A qualitative argument to estimate whether the ferromagnetic phase survives or not 
in presence of the external random field is called the Imry-Ma argument in this context 
and it is a simple extension of the discussion in Sect. 4.3. Let us fix T = and switch on 
a random field. If a domain V of a the opposite order (say down) is created within the 
bulk of the ordered state (say up) the system pays an energy due to the unsatisfied links 
lying on the boundary that is 

AE border ~ 2JL d - 1 (5.24) 

where L is the linear length of the border and d — 1 is the dimension of the border of 
a domain embedded in d a dimensional volume, assuming it is compact. Typically the 
magnetic energy in the interior of the domain due to the external field is 

AE rf ~ -hL d/2 (5.25) 

since there are N = L d spins inside the domain of linear length L and, using the central 
limit theorem, —hJ2jev s i ~ —h^/N = —hh d l 2 . The comparison between these two 
energy scales yields 

2 

JL d-i „ _ hL d/2 2 - d „ L (5 26) 

This indicates that the creation of domains at zero temperature is favorable in d < 2 and 
it is not favorable in d > 2. (The marginal case d = 2 is more subtle and we do not 
discuss it here.) The size of the domain is given by the expression above. 

A very similar reasoning is used to argue that there cannot be spin-glass order in an 
Edwards-Anderson model in an external field. The only difference is that the domain wall 
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energy is here assumed to be proportional to V s with an a priori unknown d-dependent 
exponent y that is related to the geometry of the domains. 

5.6.2 The naive mean-field approximation 

Disordered models have quenched random interactions. Due to the fluctuating values of 
the exchanges, one expects that the equilibrium configurations be such that the spins 
freeze in different directions. The local averaged magnetizations need not be identical. At 
low-temperatures, and quite surprisingly a priori the naive mean-field equations (4.20) 
have an exponential in N number of minima. These can be identified as different equilib- 
rium states that could be accessed by applying the corresponding site-dependent pinning 
fields. 

Even if it has been shown by Thouless- Anderson-Palmer [16, 17] that these equations 
are not correct in the fully-connected disordered case, a term which is called the Onsager 
reaction term is missing, the fact that the number of equilibrium states diverges as an 
exponential of N in the thermodynamic limit remains true for these models. We shall 
not derive the correct TAP equations here but we shall describe the main features they 
predict. 

The free-energy density as a function of the local magnetizations m; defines what is 
usually called the free-energy landscape. Note that this function depends on iV > 1 
variables, m i5 that are not all identical in the disordered case in which the interactions 
between different groups of spins are different. 

For all models, at high temperatures f(rrii) it is characterized by a single minimum in 
which all local magnetizations vanish, as expected; this is the paramagnetic state. 

At very low temperature there are many equilibrium states (and not just two as in an 
Ising ferromagnetic model) and they are not related by a symmetry (as spin reversal in 
the Ising ferromagnet or a rotational symmetry in the Heisenberg ferromagnet). These 
are characterized by non-zero values of the local magnetizations that are different in 
different states. 

On can also access the saddle-points of the free-energy landscape by looking for sta- 
tionary points of /(m,) that are not the absolute minima or have unstable directions. The 
number and structure of saddle-points is particularly interesting in the p > 3 cases and 
it is indeed the reason why these models, with a random first order transition, have been 
proposed to mimic the structural glass arrest. 

The derivation and understanding of these results is quite subtle and goes beyond the 
scope of these Lectures. Still, we shall briefly present the phenomenology of these models 
here, jus to give a flavor of their complexity. Below a temperature T c , an exponential 
(in N) number of metastable states contribute to the thermodynamics in such a non- 
trivial way that their combined contribution to the observables makes them those of a 
paramagnet. Even if each of these states is non-trivial (the m/s are different from zero) 
the statistical average over all of them yields results that are identical to those of a 
paramagnet. At a lower temperature T s (T s < T c ) there is an entropy crisis, less than an 
exponential number of metastable states survive, and there is a static phase transition to 
a glassy state. 
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Thus, 



5.6.3 The replica method 

A picture that is consistent with the one arising from the naive mean-field approximation 
but contradicts the initial assumption of the droplet model arises from the exact solution 
of fully-connected spin-glass models. These results are obtained using a method which is 
called the replica trick and that we shall briefly present below. 

In Sect. 5.2.2 we argued that the typical properties of a disordered system can be 
computed from the disorder averaged free-energy 

[Fj] av = J dJP(J)Fj. (5.27) 

The replica method allows one to compute [Fj] av for fully-connected models. It is based 
on the smart use of the identity 

Z n — 1 

lnZ=lim . (5.28) 

-f3[Fj} av = - J dJP(J) In Z j = -lim^d dJP(J)Z n j - lj , (5.29) 

where we have exchanged the limit n — > with the integration over the exchanges. For 
integer n the replicated partition function, Zj, reads 

Z]=J2 e -^« s *})+-+^(K}] . (5.30) 

Here E{ s <>} = E{ s i=±i} • • • E{ s ™=±i}- Zj corresponds to n identical copies of the original 
system, that is to say, all of them with the same realization of the disorder. Each copy 
is characterized by an ensemble of n spins, {s"}, that we label with a replica index 
a = 1, . . . , n. 

With this method one can study p spin disordered spin models, for which Zj takes the 
form 

Zj = ^2 e^^H^i^-^p Ji i - i p s ii--- s i P + 52i hiS i \ (5 3-Q 

K> 

The average over disorder amounts to computing a Gaussian integral for each set of spin 
indices i±, . . . i p . One finds 



{ Z n^ v = ^ e 5§^rE !1 ^...^ p (E a < 1 -^) 2 +/ 3 E a E l ^ 

= ^e-mKl). (5.32) 

{«?} 

The function /9F({s"}) is not random. It depends on the spin variables only but it includes 
terms that couple different replica indices: 

ily^...y^i p ab a i 



a^b \ JV i / 
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In writing the last expression we have dropped terms that are subleading in N (in complete 
analogy with what we have done for the pure p spin ferromagnet). The constant term 
—Nnf3 2 J 2 /2 originates in the terms with a = 6, for which (s") 2 = 1. 

To summarize, we started with an interacting spin model. Next, we enlarged the num- 
ber of variables from N spins toiVxn replicated spins by introducing n non-interacting 
copies of the system. By integrating out the disorder we decoupled the sites but we payed 
the price of coupling the replicas. Hitherto the replica indices act as a formal tool in- 
troduced to compute the average over the bond distribution. Nothing distinguishes one 
replica from another and, in consequence, the "free-energy" F({s®}) is invariant under 
permutations of the replica indices. 

The next step to follow is to identify the order parameters and transform the free- 
energy into an order-parameter dependent expression to be rendered extremal at their 
equilibrium values. In a spin-glass problem we already know that the order parameter is 
not the global magnetization as in a pure magnetic system but the parameter q - or more 
generally the overlap between states. Within the replica calculation an overlap between 
replicas 

q ab = N- 1 £ s1s\ (5.34) 

i 

naturally appeared in eq. (5.33). It can be shown that this expression plays the role of an 
overlap between states labeled, say, by a and (3 

q af , = N - 1 J2(s l ) a (s l ) /3 (5.35) 

i 

The statistics of possible overlaps is then characterized by a probability function 

^(?) = E%-9«0) (5.36) 

a/3 

and it is the function that extends the concept of the order parameter and allows one to 
distinguish between the droplet picture prediction that corresponds to 

P(Q) = \ (<% " Qea) + S(q + q ea )) (5.37) 

and a more complicated situation in which P(q) has the two delta functions at ±q eq plus 
a non-zero value on a continuous support in between. 

The idea is to write a free-energy density as a function of the order parameter q ab and 
then look for the extreme that minimizes it, in complete analogy with what has been 
done for the fully-connected ferromagnet. This is, of course, a tricky business, since the 
order parameter is here a matrix with number of elements n going to zero! A recipe for 
identifying the form of order parameter (or the correct saddle-point solution) has been 
proposed by G. Parisi in the late 70s and early 80s. This solution has been recently proven 
to be exact for mean-field models by two mathematical physics, Guerra and Talagrand. 
Whether the very rich physical structure that derives from this rather formal solution 
survives in finite dimensional systems remains a subject of debate. 

Symmetry and ergodicity breaking 

Let us summarize some of the main features of the solution to the SK model obtained 
with replicas. There is a phase transition at a finite T c at which the replica symmetry is 
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broken. This is a rather abstract symmetry. This symmetry breaking is accompanied by 
ergodicity breaking as in the usual case. 

Right below T c an exponential in N number of equilibrium states appear. The transi- 
tion is continuous, the order parameter approaches zero right below T c . Lowering further 
the temperature each ergodic component breaks in many other ones. In this sense, the 
full spin-glass phase, T < T c , is 'critical' and not only the single point T c . 
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6 Stochastic processes 

Simultaneously to the development of equilibrium statistical mechanics, which assumes 
that systems can be described in terms of Boltzmann-Gibbs probability distributions, the 
question of how this equilibrium is reached was raised (in fact by Boltzmann himself), 
and led to the well known debate about irreversibility and the arrow of time. Although 
the conceptual issues are formidable, one can take a phenomenological point of view and 
postulate the nature of the coupling between a given system and a thermal bath such 
that equilibrium is, at least in principle, reached at long times. This is known as the 
Langevin approach, which provides a consistent description of the dynamics of systems 
subject to thermal noise, such that the stationary solution reproduces the Boltzmann- 
Gibbs distribution. 

On can be interested in situations such that the dynamics should lead at long times 
to equilibrium, but the time needed to do so is either infinite or very large compared 
to experimental time scales. 2 These systems are usually referred to as 'glassy'. In these 
cases, equilibrium concepts are a priori useless; the description of the system is inherently 
of dynamical nature. The theoretical framework available to describe the dynamics of a 
system subject to thermal noise is good for both cases. 

Almost any physical system is subject to fluctuations that have an unknown origin 
and/or can only be characterized only statistically. This noise is one of the manifes- 
tations of the exchange of energy between the system and its environment; the other 
accompanying feature is, as we will see in more detail below, dissipation. The time evolu- 
tion of a system coupled to its environment can be described in two equivalent ways. One 
is the Langevin approach that consists in studying Newton's equations with the addition 
of two terms representing friction and thermal noise. The other description is probabilistic 
and is concerned with the evolution of the probability distribution of the relevant degrees 
of freedom of the system. Both approaches turn out to be very useful to understand the 
dynamics of model systems. 

The environment is often assumed to be a heat bath in equilibrium at a given temper- 
ature. In these conditions, many systems do equilibrate with such an environment after 
a short transient. The dynamics of equilibrated systems has several special features that 
we also review in this Chapter. How these properties are modified in systems that never 
reach equilibrium is a problem of current interest in research but we shall not develop it 
here. 

6.1 The Langevin equation 

The Langevin equation is a stochastic differential equation that describes phenomenolog- 
ically a large variety of problems. It describes the time evolution of a set of slow variables 
coupled to a much larger set of fast variables that are usually (but not necessarily) as- 
sumed to be in thermal equilibrium at a given temperature. We first introduce it in the 
context of Brownian motion and later derive it in more generality. 

2 There might of course be systems that never reach thermal equilibrium, such as driven dissipative 
systems for example. 
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6.1.1 Brownian motion 

The Brownian motion is the erratic motion of a mesoscopic colloidal particle (in the 
micrometer range) immersed in an equilibrated fluid made up of much lighter molecules. 
The jerky motion of the colloidal particle is due to the collisions with the molecules in 
the liquid. Knowing that each individual collision deflects the trajectory of the particle 
by a tiny amount, Langevin proposed to collect the effect of all molecules in the fluid in 
a time-dependent random force. If the particle moves in d dimensions, has mass m, its 
center of mass position at time t is represented by x — (x± : . . . ,Xd) and its velocity is 
v = x, Newton's equation reads 

mi?(t) = F(x,t) + f(t) . (6.1) 

— * 

The force F designates all external deterministic forces, and depends in general also 
on the position of the particle x, while / represents the net force exerted by all the 
individual molecules in the fluid on the massive particle. The latter depends on the time- 
dependent positions of all the molecules and is a rapidly fluctuating function of time. The 
characteristic time for the variation of this force is related to the time interval between 
successive collisions, that we call r c , and can be estimated to be of the order of the 
pico-second or even shorter for a typical liquid. 3 

— » 

Due to its rapid fluctuations, the time-dependence of the force / cannot be specified. 
One can, instead, make reasonable assumptions about its average over a large number of 
identical macroscopic situations and characterize it in statistical terms. More precisely, 
one considers an ensemble of n systems made of one tracer particle moving in a fluid and 
that are prepared in identical conditions, and defines ensemble-average quantities 

1 n 

{0{t)) = -Y t Of{t) i (6.2) 
n k=i 

with the label k identifying the copy in the ensemble and O is an observable that depends 
on the force /. (Equivalently, one can consider a single experiment in which one uses n 
identical non-interacting tracer particles moving in the medium.) Equation (6.1) implies 
that the position and velocity of the particle are both fluctuating quantities that depend 
on /. The aim is then to predict the average result (and the fluctuations) over a large 
number of experiments performed in identical conditions, or equivalently over a large 
number of particles provided these are non interacting. 

In the absence of external forces, the problem is fully isotropic and the ensemble- 
averaged velocity can only vanish. If however the particle acquires a non- vanishing velocity 
v, then the environment will react and induce a non zero force which, by symmetry, must 
be in the direction of v. Thus, one can propose the following decomposition: 

f = f 1 (v)v + ^ (6.3) 

where fi is some function of the modulus of v, and £ is an isotropic random force, the 
thermal noise, that keeps the agitation of the particle (this name has its origin in the 

3 This does not mean, however, that the correlation function of this random force is short range in 
time. Because of momentum conservation in the surrounding fluid, there appears a now well known 'long 
time tail' in the force correlation function, only decaying as i~ 3 / 2 in three dimensions. 
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random noise that one can actually hear in electric circuits). £ has vanishing ensemble- 
average at each instant t: ) = 0, for all % and all times. The average over different 
realizations of the history of the system introduced in eqn (6.2) corresponds now to an 

— # 

average over histories of the time-dependent random force, £. Thus, henceforth the angular 
brackets represent: 

roc M d . 

( o(t) > = / n II d Uh) PUtj)} o&) = / vi p(£) o$) , 

J °° j=0 i=l J 

where P[£i(tj)] is the probability of occurrence of & at time tj = jS, with j — 1, . . . , J\f u 
and 5 an infinitesimal time-interval. In the second term above we wrote it explicitly, and 
in the third term we used a short-hand notation that we adopt hereafter. The subindex 

— * 

£ in O indicates that it is evaluated in the solution to the Langevin equation and hence 
depends on the thermal noise realization. 

The simplest assumption for fi(v) is that it tends to a constant for small velocities, 
leading to the familiar friction force: 

/i(u->0)u = -7« 7>0, (6.4) 

that opposes the motion of the particle. The friction coefficient, 7, is proportional to the 
shear viscosity rj of the medium 4 , 7 = crj > 0, with c a constant of geometric origin that 
depends on the size and shape of the colloidal particle. When the medium is a normal 
fluid, and the particle is a sphere of radius a that is much larger than the mean free-path 
of the molecules one has c = Qna and one recovers the Stokes law for a spherical particle 
in a viscous fluid. The friction coefficient 7 is of the order of n v lr c T with n p the density 
of particles in the fluid, £ the mean free path, r c the average time between collisions, the 
temperature T is measured in units of the Boltzmann's constant k B . For a typical liquid, 
like water in normal conditions, the viscosity is of the order of 0.01 Poise. 5 With the 
choice (6.4) eqn (6.1) becomes 

m j(t) = - 7 i?(t) + F(x, t) + £{t) . (6.5) 

So far we have characterized the random force exerted by the fluid by giving its average 
value, the friction force. In order characterize more completely the motion of the particle, 
one also needs to know how the random force fluctuates in time. This information is (in 
part) contained in its correlation function, defined by comparing the thermal noise at two 
subsequent times t and with t — t' + t. Since collisions are very irregular, one can 
assume that the forces at two different times are statistically independent for long enough 
time-separations period (but see footnote 3), i.e. 

(UtMt')} = (Ut)}(^(t')) = \ir = t-t'^r c V/../. 1 ,1. 

4 The shear viscosity of a system measures its resistance to flow. A flow field can be established by 
placing the system between two plates and pulling them apart in opposite directions creating a shear- 
force. The rate at which the plates are pulled apart is called the shear rate. Other geometries are also 
possible. 

5 The friction coefficient and the viscosity are measured in units of [mass]/[time] and Poise= 
[energy x time] /[volume] in the CGS system, respectively. 
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The correlation between the same component of the random force evaluated at different 
times is, by definition, a symmetric function of times, ( £;(t)6(t') ) = (£i(t')£i(t)). In 
addition, one assumes that the correlation between the same component of the force at 
two different times is stationary, that is to say, that it only depends on the time-difference 
r. This is a property of the reservoir (the fluid in our case) in thermal equilibrium. Finally, 
since we assume that all directions of space are equivalent, the components of the random 
force in different directions are uncorrelated (even at time-differences that are shorter 
than r c ): 

<&(W)> = <W|r|), vi.j 1 d. (6.6) 

This comes from the fact that 5ij is the only rotationally invariant tensor. In the above 
equation, — fl'(O) > an d 9(\t\) sharply peaked around r = with a support of 

the order of r c . At macroscopic time-scales, r is actually much longer than r c , and one 
can approximate g(\r\) by a delta function of weight 2 A 

( UtMt') ) = 2A8 ij 8{t) , V i,j = 1, . . . , d . (6.7) 

The Fourier transform of the stationary correlation of the noise defines its spectral density, 
S(cu). In this case, S(cu) is independent of the frequency u>, defining a white noise. 

Two-time correlations do not characterize time-dependent random variables completely. 
This is achieved either by giving all higher-order correlations or, equivalently, by specifying 
its full multivariate probability distribution function. The simplest choice is a multivariate 
Gaussian distribution compatible with the above two-time correlation. In discrete time, 
we write: 

1 / i M d \ 
P (0 = ^expf-— EEtffe)) > ( 6 - 8 ) 

with M the normalization constant. The Gaussian hypothesis is based on the central limit 
theorem. Indeed, if one observes the thermal force acting on a sufficiently large particle, 
with a time-grid that is much larger that r c , £ is the result of a large number of random 
forces with finite average and variance, all sharing the same distribution law. However, 
one could imagine 'sporadic' thermal baths that would lead to deviations from a Gaussian 
distribution of random forces, in particular in the tails. 

The Gaussian hypothesis implies that all higher-order correlations can be expressed as 
functions of the two-time correlation (6.6). This is the content of Wick's theorem, which 
states that: 

Ui 2n (t2n) ■ .•&(*!) ) = E Ui 2n (t2n^i 2n .At2n-l) ) ■ ■ • ( £i 2 (t 2 )& (*l) ) , 

pairs 

with the sum running over all the different ways of separating the product of 2n variables 
into n pairs. The average of a product of an odd number of noise factors vanishes identi- 
cally since the thermal noise has zero-average (remember that the average of the random 
force, namely the friction force, has been subtracted off). So far the variance, 2 A, is a free 
parameter. We shall see later on that if the environment is assumed to be in equilibrium 
at a temperature T, then A must in fact relate to 7 and T. 

Irreversibility and dissipation. 
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The friction force —'jv in the Langevin equation (6.5) explicitly breaks the time- reversal 
(t — > —t) invariance, a property that has to be respected by any set of microscopic dynamic 
equations. This is the well known paradox raised by the irreversibility of thermodynamics. 
However, the Langevin equation is an effective equation that only describes the particle 
and not the individual motion of the molecules of the surrounding fluid. Of course, 
Newton's equation describing the whole system, the particle and all the molecules of the 
fluid, must be time reversal invariant. However, time-reversal can be broken in the reduced 
equation, where the ability of the thermal bath to reach equilibrium is assumed from the 
start. The Langevin approach is a clever way to hide the irreversibility problem under the 
rug (by transferring the conceptual difficulties to the reservoir) and allows to investigate 
in a phenomelogical way the dynamics of the particle alone. In Sect. ?? we shall study 
a simple model where the assumptions behind Langevin's equation can be made more 
transparent. 

Note that the energy of the particle is not conserved and, in general, flows to the bath 
leading to dissipation. At very long times however, the particle may reach a stationary 
regime in which the exchange of energy becomes symmetric on average: the particle gives 
and receives energy from the bath at equal rate. 

Generation of memory. 

The Langevin equation (6.5) is a first order differential equation on the velocity. The 
full dynamics of the particle is determined by this equation together with v(t) = x(t) 
which is also a first-order differential equation. 

These features imply that the pair velocity-position of the particle at time t + 5, with 
5 an infinitesimal time-step, depends on the pair velocity-position at time t and the value 
of the noise at time t. Thus, the full set of equations defines a Markov process, that is 
a stochastic process that depends on its history only through its very last step. Notice, 
however, that the pair of first order differential equations could also be described by a 
single second-order differential equation: 

m£(t) + i'x{t) = F(x, t) + . (6.9) 

Having replaced the velocity by its definition in terms of the position the Markov character 
of the process is lost. This is a very general feature: by integrating away some degrees of 
freedom one generates memory in the evolution. Generalizations of the Langevin equation, 
such as the one that we present in Sect. 6.1, and the ones that will be generated to describe 
the slow evolution of super-cooled liquids and glasses, do have memory. 

Fluctuation - dissipation relation of the second kind. 

In the heuristic derivation of the Langevin equation that we presented above the con- 
stant A is not fixed. The simplest way of fixing the value of this parameter is to study 
the velocity fluctuations of a Brownian particle in a constant external force. The time- 
dependent velocity follows from the integration over time of eqn (6.5) 

v(t) = v e-^ + — C dt! e-ZW [ F + ] , 
m Jo 

with v the initial velocity at t — 0. Using the fact that the noise has zero average one 
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finds 

— * — * 

( v(t) ) = v e-™ 1 + — (l - e~™*) -> — when i > ^ = ™ . 

Using the noise-noise auto-correlation in eqn (6.7), and setting v = for simplicity, one 
readily calculates the mean-square displacement of the velocity in each direction of space, 

<W = <(^)-<^)» 2 >, ' 

<(t) = ~ 2 fdt' fdt" e-3**-*-n m%(t") > = — (l - eS*) . 
m z Jo Jo 7m v 7 

Since the Langevin equation is a phenomenological description of the approach to thermal 
equilibrium, we must impose for consistency that the above quantity saturates to the 
expected value calculated with the canonical distribution at temperature T. Thus, 

Hm*J(f) = — = = 

where ( ) eq denotes an average taken with Maxwell's velocity distribution. For this 
equality to hold one enforces that: 

A = 7 T . (6.10) 

This relation is known under the name of fluctuation-dissipation theorem (fdt) of the 
'second kind' in Kubo's nomenclature. (The 'first kind' will be discussed below; these 
names are here a little unfortunate). 

It is important to note that this fdt characterizes the surrounding fluid and not the 
particle, since it relates the noise-noise correlation to the friction coefficient. In the case 
of the Brownian particle this relation ensures that after a transient of the order of t c , 
the bath maintains the mean kinetic energy of the particle constant and equal to its 
equilibrium value. The Gaussian distribution of the noise and the linear relation linking 
it to the velocity imply that the velocity of the particle is indeed distributed according 
to Maxwell's distribution. We shall see later that even when the environment satisfies a 
fluctuation - dissipation relation (fdr) the system in contact with it does not necessarily 
follow and satisfy an FDR itself. This is one of the main characteristics of non-equilibrium 
systems in contact with equilibrated environments. 

Diffusion in velocity space. 

For the sake of simplicity let us focus in this section on a one dimensional problem. 
The two-time velocity-velocity 'variogram', defined as: A vv (t,t') = ((v(t) — v(t')) 2 ) = 
C vv (t,t) + C vv (t',t') — 2C vv (t,t') is a simple function of the two-time correlation of the 
velocity C vv (t,t') = (v(t)v(t') ) that itself is easily calculated to be: 

C vv (t,t') = vle~^^ + - (e-ZW-e-ZW) . (6.11) 

The first term comes from the initial condition and the second term sometimes goes under 
the name of Dirichlet correlator. When t + 1' t c the initial condition is forgotten, the 
last term vanishes, and the correlation and displacement become functions of \t — 1'\ only, 
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as expected in equilibrium (see Sect. 6.3.1). The variogram crosses over from a diffusive 
regime to saturation at the characteristic time t v c : 

[ 2D v \t - t'\ when \t - t'\ < t v c with D v = ^ , 

\ — when \t - t'\ > tl . 

v m 

The coefficient D v of the linear regime is called the velocity diffusion coefficient. 

Diffusion in position space. 

The position-position variogram is similarly defined as A xx (t,t') = ( (x(t) — x{t')) 2 ) 
and measures the square of the typical displacement of the particle between t and t'. 
Choosing x = v = one finds: 

2AV l t-t'\ + — (e^' 1 - 1) - - (e-^'^'l - l)" 



A xx (t,t') = 



7 2 



7 V / 27 

and there is also a change in behavior at time-differences of the order of t v c : 

!2D 
— ^|t-t'| 3 when \t - t'\ < t v c , 
T 
2D x \t - t'\ when \t - t'\ > t v r with D x = — . 
7 

At small times, the particle is subject to a random acceleration; its velocity is thus of the 
order of ^J\t — 1'\, leading to a typical displacement of the order of \t — t'\ 3 ^ 2 . For times 
larger than t v c , the velocity saturates under the effect of friction, and normal diffusion sets 
in, with a typical displacement given by \J D x \t — t'\. 

Einstein relation. 

The mobility of the particle is defined as 

u = lim lim i^&l. (6.12) 
Using the above result on D X1 we find: 

/i = - = ^ when \t - t'\ > t v c , (6.13) 

7 T 

This identity between \x and D x is known as the Einstein relation between the transport 
properties of the particle, its diffusion in real space, and the temperature of the surround- 
ings. It expresses the fluctuation-dissipation theorem of the first kind, to be distinguished 
from the one of the second kind by the fact that it now describes a dynamic property of 
the particle induced by those of the bath. 

Stokes- Einstein relation. 

We have mentioned that in a normal fluid the viscosity and friction coefficient of the 
frictional force exerted on an spherical particle of radious a are related by the Stokes law 
7 = Guar/. This relation, combined with (6.13) implies the Stokes- Einstein relation 

T 

6nar] = — - 
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linking the viscosity to the temperature and spatial diffusion constant. This prediction 
first obtained by Einstein and then confirmed by Langevin was experimentally verified 
by Perrin. Much more recently, though, it has been noticed that in liquids that are 
super-cooled this relation ceases to be valid. 

Smoluchowski (strong overdamped) limit. 

In many situations in which friction is very strong the inertial term mv can be dropped 
from eqn (6.5): 



This limit is acceptable when the observation times are much longer than the characteristic 
time for relaxation t v c = ra/7, such that the mean-squared displacement of the velocity 
saturates and the position diffuses. The range of validity of this approximation can be 
easily put to test in the example of a particle moving in a harmonic potential. 

In the white-noise case the friction coefficient can be eliminated from the Langevin 
equation in the Smoluchowski limit and the noise-noise correlation with the rescaling of 
time: r = t^ -1 , x{t) = x{t^~ r ). 

6.1.2 Generalized Langevin equations 

Langevin-like equations are used to describe the dynamics of a much more general mi- 
croscopic or macroscopic systems coupled to environments. The applications in physics, 
chemistry and engineering are numerous. One can cite, for instance, the description of the 
dynamics of macromolecules in solution, with the analysis of the electrophoresis technique 
as a particular case. The time required to dissociate molecules or the transition rate be- 
tween molecular configurations are subjects of great interest in chemistry that have been 
attacked with the Langevin approach. Stochastic equations with damping and white noise 
are also used to describe noisy electric circuits. To treat glassy problems that are typ- 
ically macroscopic systems constituted by particles (colloids, atoms, molecules, spins...) 
in contact with an environment (the solution, phonons...) that is described statistically 
we need to justify them beyond the simple Brownian motion problem. 

Langevin equations with multiplicative noise. 

So far the random force appeared as a separate term in the Langevin equation. Exten- 
sions of the Langevin approach with multiplicative noise lead to the so-called non-linear 
Langevin equations [?] in which the usually delta-correlated Gaussian noise £ multiplies a 
certain function e{x, t) of the stochastic variable x itself: 



However, as written, this equation is at best ambiguous. A proper prescription in a dis- 
crete time setting must be specified. This is a very important point that we shall not 
discuss here. It leads to what is called Ito and Stratonovich calculus, basically differ- 
ent prescriptions as to how to interpret the continuous differential equation and how to 
discretize the time-derivative. 

Langevin equations with memory. 



F(x,t)+i(t) . 



(6.14) 




(6.15) 
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A derivation of a generalized linear Langevin equation with memory is very simple. In 
general, one studies the coupled (closed) system made by the actual system of interest 
in interaction with an environment. The description of the bath and of its interaction 
with the system depends on the problem at hand. The simplest choice is that of an 
independent ensemble of harmonic oscillators that couple linearly to each coordinate-like 
degree of freedom in the system. This choice allows one to solve the dynamic equations for 
the bath variables analytically. After introducing their solution in the dynamic equations 
for the system, one thus obtains the dynamics of the reduced system. Until this point the 
dynamics of the system remains deterministic and is completely determined by its initial 
conditions as well as those of the reservoir variables. The statistical element comes in when 
one proposes that the initial coordinates and momenta of the oscillators in the bath are 
distributed according to an equilibrium measure. This induces randomness and friction 
in the dynamics of the reduced system. After performing explicitly these calculations one 
ends up with the generalized Langevin equation: 

mkt) = F(x, t) + |*(t) - C dt' 7ft - *')£(*') . (6.16) 

Jo 

with the Gaussian thermal noise characterized by 

(&(t)) = , Vi and Vt , (6.17) 
mMt')} = T5 in (t-t f ), (6.18) 

and j(t — t') a retarded friction. A multiplicative retarded noise arises from a model in 
which one couples the coordinates of the oscillators to a generic function of the coordinates 
of the system. 

Different oscillator reservoirs are characterized by different kernels ^{t — t'). The spec- 
tral function of the colored bath is usually assumed to have the form 




S(u>) = T7H = 2T 7 [LA] U V ■ (6-19) 



The function f c (x) is a high-frequency cut-off of typical width A and is usually chosen 
to be an exponential. The frequency Co <C A is a reference frequency that allows one to 
have a coupling strength 7 with the dimensions of a viscosity. If 5 = 1, the friction is 
said to be Ohmic, S(uo) is constant when \uo\ <C A and one recovers a white noise. When 
5 > 1 (5 < 1) the bath is superOhmic (subOhmic). The exponent 5 is taken to vary in 
the interval [0, 2] to avoid divergencies. 

Time-dependent, /(£), and constant non-potential forces, f np , as the ones applied to 
granular matter and in rheological measurements, respectively, are simply included in the 
right-hand-side (rhs) as part of the deterministic force. When the force derives from a 
potential, Fi(t) = —d Xi V(x(t)). 

In so far we have discussed systems with position and momentum degrees of freedom. 
Other variables might be of interest to describe the dynamics of different kind of systems. 
In particular, a continuous Langevin equation for classical spins can also be used if one 
replaces the hard Ising constraint, Sj = ±1, by a soft one implemented with a potential 
term of the form V(sj) = u(sf — l) 2 with u a coupling strength (that one eventually 
takes to infinity to recover a hard constraint). The soft spins are continuous unbounded 
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variables, G (—00, 00), but the potential energy favors the configurations with Sj close 
to ±1. Even simpler models are constructed with spherical spins, that are also continuous 
unbounded variables globally constrained to satisfy J2iLi $1 — N. 

6.2 Averages, correlations and responses 

In Sect. 6.1 we analyzed several averaged properties of Brownian motion. In this Section 
we introduce the averaged observables on which we shall focus in the rest of the book. 
We use a vector x to encode all dynamic degrees of freedom in the system. These can be 
position and momenta for a particle system, spins for a magnetic sample, or even a field 
such as the local and time-dependent density. 

6.2.1 One-time functions 

The simplest dynamic observables depend only on one time and are defined by 



where £ symbolically denotes the noise at all times. The integral indicates a sum over 
all possible noise history realizations, each counted with its weight This is a path- 

integral and we shall not discuss it here. Even if A can in principle be any complicated 
functional of the system's degrees of freedom, in most cases of interest it is a simple 
scalar function of these variables. Typical examples we shall examine later are the energy 
density, the magnetization density in a magnetic system, the particle density in a liquid 
or a glass, etc. 

6.2.2 The correlation functions 

Given any two functionals of x, say A(x, t) and B(x, t), one defines the correlation function 
between A and B at two subsequent times t and t' as 



Note that the auto- correlations are, by definition, symmetric under exchanges of t and t', 



In a generic situation C AB (t,t') is a function of both times t and t'. We shall see 
that when the system reaches a steady state, and in particular the equilibrium measure, 
Cab(A t') becomes stationary and a function of time-differences only Cab^, t') = C^ B (t — 

= Cab( t ) w ith t = t — t'. In complete generality we write the two-time correlator as 
Cab{^, t') = Cab(t, t') and we define Fourier transforms with respect to the time difference 




(6.20) 



CUb(M') = (A(x,t)B(x,t')) = J v£p(£)A(xz,t)B(xs,t') 



C AA {t,t')=C AA {t',t). 



r: 




with the inverse Fourier transform given by 
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to ~n n W 



Figure 8: Sketch of perturbations of strength h; a kick (left) and a step (right). 

In the stationary limit these become the usual expressions 

C AB {u) = / dr CUr) , Cf B {r) = / — C AB {u) e — , 

and C AA (uo) is the spectral density of A. 

Sometimes it is useful to calculate the connected correlation functions in which one 
correlates the fluctuations of A and B about their averages: 

C c AB (t,t') = ((A(x,t) - (A(x,t)))(B(x,t') - (B(x,t')))) . 

Other important two-time quantities in the analysis of dynamical problems with diffu- 
sion are the displacement between observables: 

A { jl(t, t') = ( (A(x, t) - B(x, t')f ) = Cab it, t) + C AB (t', t') - 2 C AB (t, t') , 

and the displacement between the fluctuations of these observables, 

Af B (t,t') ee ({A{x,t)-{A{x,t))-B{x,t') + {B{x,t'))f) 
= C c AB (t,t) + C AB (t',t')-2C AB (t,t'). 

6.2.3 The linear response 

The application of an infinitesimal external force of strength h, possibly time-dependent, 
that couples linearly to a generic function B of the system's degrees of freedom, modifies 
the Hamiltonian according to 

H -> H - h(t)B(x,t) . 

We represent the instantaneous infinitesimal perturbation h(t) as the kick between t 2 — 5/2 
and t 2 + 5/2 in Fig. 8-left and a step-like perturbation that is continuously applied after 
time t 2 in the right panel of the same figure. 

The variation of the average of another generic function A evaluated at time t, due to 
the applied force is 

poo 

5( A(x, t))= dt' R AB {t, t') Sh(t') . (6.21) 
Jo 

This relation defines the instantaneous linear response R AB (t,t'). 

It is important to notice that the perturbation can modify the particle's position only 
at future times. This is the reason why the integration in (6.21) can only span the time 
interval going from the initial time t' = to the measuring time t' = t. Otherwise stated, 
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the instantaneous linear response is causal and hence proportional to the Heavide theta 
function: 

R AB (t, t') = = r AB (t, t') 0(t) r = t-t'. (6.22) 

Equation (6.21) then means that the change in the averaged function A due to the coupling 
of B to the force h exerted during a finite time-interval is a linear superposition of the 
changes due to spike-like perturbations applied at each instant in the same time-interval. 

A very useful expression for the linear response of a Langevin process with additive 
noise is a consequence of Novikov's formula, which applies to general functions of Gaussian 
random variables with zero mean and states that 

(*(t)W)) = fdt" (mm) • (6-23) 

Using the noise-noise correlation (6.18) and the fact that the second factor in the integrand 
equals the linear response one finds 

/•oo 

(x(t)at'))=T dt" 7 (f,t") R(t,t") ■ (6.24) 
Jo 

For a white noise -f(t' , t") = 2^5{t'-t") and the relation becomes: 2-fTR(t, t') =( x(t)£(t') 
The Ito convention assumes that the noise term £(£) has a strictly zero correlation time 
and is always slightly posterior to (and therefore independent of) any function G(x ) t), 
possibly determined by all previous values of £(£'); t' < t. Therefore, (£(t)x(t) ) vanishes. 
The Stratonovich rule, on the other hand, assumes that the noise term £(t) has a very 
small, but non zero correlation time, which is taken to zero after the continuous time limit 
of the Langevin equation has been taken. This implies that the linear responses take the 
boundary values: 

Stratonovich R(t, t) = I/27 , R(t, t~) = 1/7 , 
Ito R(t,t) = 0, R(t,t) = I/7. 

A function that we shall explore in great detail in future chapters is the integrated 
linear response 

XAB (t,t w )= [ dt" R AB (t,t") . (6.25) 

J t w 

It represents the influence on the observable A of a perturbation applied at the waiting- 
time t w and held constant until the measuring time t, as the one sketch in Fig. 8-right, 
normalized by the strength of the applied field. 

The Fourier transform with respect to the time-difference r of the response function 
at fixed time t' reads 

/oo 
drr AB (t' + r,t')e(r)e^ , (6.26) 
-00 

and defines a generalized ac- susceptibility. The real and imaginary parts of Xab, are 
called x'ab an d Xabi respectively. The generalized ac-susceptibility satisfies Xab( u ^') = 
—Xab(—u ] ,t') and x'ab an d Xab are 5 respectively, even and odd functions of u: 

xab(w,0 = xW-w,0 > XAB^,t') = -x f AB (-iv,f) . 

In equilibrium XAB (u,t') — * x AB (uo) since xab(M') — *■ Xab( t ) r" 01 all observables A and 
B. 
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6.2.4 Multi correlations and responses 

More general correlations and linear responses involve many time and possibly many point 
functions. More precisely, one defines 

C An A n _,...A,{tm...M) = (A n (x,t n )...A 1 (x,t 1 )) , (6.27) 

and the reaction of many observables to several fields linearly coupled to the system at 
different times; i.e. if H — > H — h(t k )A k (t k ) or H — > H — h(t k )A k {t k ) — h(t k -i)A k (t k -i), 



p /, j. j. \ 6(A n (t n )...A 1 (t 1 )) 

ttA n ...A 1 ;A k {tn, ■ ■ ■ , tl, tk) — 



RA n ...A 1 ;A k ,A k _ 1 (tn, ■ ■ ■ ,ti, t k , t k -l) — 



6.2.5 Absorption and dissipation 



5h k (t k ) 
5 2 (A n (t n )...A 1 (t 1 )) 



h=0 



5h k (t k )5h k _ 1 (t k . 



(6.28) 
(6.29) 



h=0 



The power absorbed by a system is just the work done by a force applied on it measured 

— * 

by unit of time. More precisely, the differential work done on the system by a force F 

— * — * 

that is coupled to an observable O when the latter changes by an amount dO is 

dW — —F ■ dO , 

and the absorbed power at time t is 

P{t) = = ~ f{t) ' ^ = ~ m It fd^o^MFjit') . 

In Fourier representation, this formula becomes 

^ duo r°° dw' r°° du" 



J-oo 2n J -oo 2tt J-oo 2tv 

x / dt' e^-^'RiAuJ) , 
Jo 



where we have shorten the notation Ry = RoiO r In a steady state where Rij(u,t') does 
not depend anymore on t', 

P(t) = -i (^-) 2 [°° duo r dw" u Fi^Rij^Fjfa) e"*^")* . (6.30) 

\2tT J J-oo J-oo 

Dissipation represents the transfer of energy from the system to the incoherent degrees 
of freedom that form its environment. The sign of the friction term in the Langevin equa- 
tion sets the direction of energy transfer from the system to the bath until equilibration 
sets in. Then, the averaged energy stabilizes and its fluctuations are only determined by 
the equilibrium measure. In this limit, using eqn (6.30) for the power and setting again 
the "initial time" to — oo, one finds that the total energy dissipated is 

/t roo A, j 

dt'P(t') = F t F 3 / — a; X "», 
-oo J-oo ZlT 
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when the applied force is a percussion, Fi(t) = FiS(t) for all i, and 

for an oscillatory force F^t) = Ficos(u t) that is applied from time t — to time t. In 
these two typical cases the total energy dissipated depends only on the imaginary part of 
the Fourier transform of the response function, x" iy This is the reason why x" is usually 
called the dissipative part of the response. 



6.2.6 The Kramers-Kronig relations. 

The Kramers-Kronig relations connect the real and imaginary parts of the frequency 
dependent linear response Xab- Usually they are derived in a stationary regime in which 
the response is invariant under translations of time. More general relations connecting 
the time dependent a.c. susceptibilities x'(v,t') and x"(u,t') can also be derived. The 
proof that we present below generalizes the one in Ref. [?] to the non-stationary case. 
The waiting time t' in the definition (6.26) will in fact be a dummy variable that plays no 
role in the derivation. The Fourier transform convolution theorem implies 

/OO (Jjjj 1 
f AB (u-u',t')e(u') , 
-oo Z7T 

f AB {u-u',t') = [°° dre^'^ T r AB (t' + r,t f ), 
and the Fourier transform of the Heaviside theta-function is 

roo j j 

9(u') = / dr e l " T 0(t) = lim = P — + tt5(u') , 

with P indicating the principal part of the integral. Hence, 

~ ( .a 1 p f°° i , u-AB^\t') r AB {u,t') 

XAB (u,t) = -P J_Ju w _ wi + —^. (6.31) 

Due to causality, the function r AB (r + t',t') is not determined for r < (since it 
gets multiplied by 6(r) = 0). One can take advantage of this freedom and choose it 
at convenience. Indeed, if one chooses r AB to be symmetric in r, r AB (—r + t',t') = 
r AB (r + t',t'), then f AB (uj',t') is real and eqn (6.31) implies f AB (uj',t') =2 Re R ab (uj, t') 
that inserted back in eqn (6.31) yields 

XU^) = - 1 -P fdJ (6.32) 

TV J LO' — U 

Instead, if one chooses r AB to be antisymmetric in r: r AB (—r + t',t') = —r AB (r + 
t',t'), f AB (uj',t') is imaginary and eqn (6.31) implies if AB (uj',t') = — 2 Im R ab (uj, t') that 
substituted in eqn (6.31) yields 

X' AB (^) = -P fdu/ (6.33) 

Equations (6.32) and (6.33) are the Kramers-Kronig relations, generalized to a non- 
stationary situation. Clearly, if R AB reaches a stationary regime, the explicit t' dependence 
disappears and the usual relations are recovered. 
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6.3 Probabilistic formalism 

In this Section we recall some features of the alternative approach to stochastic processes 
that consists in studying the evolution of the time-dependent probability distribution of 
the system's degrees of freedom, instead of writing a stochastic evolution equation for 
their dynamics. 

6.3.1 Time-dependent probabilities 

Let us consider a stochastic processes characterized by a dynamic variable x. For sim- 
plicity we use a notation in which we encode all dynamic variables in a single vectorial 
degree of freedom x = (x 1: x 2 , ■ ■ ■ , x d ). Later we shall make explicit the case in which 
one has position-like and momentum-like variables. The time-dependence of x can be 
dictated by a Langevin equation of the kind discussed in the previous Sections or it can 
be determined by other stochastic dynamic rules. Since its evolution is not deterministic, 
it can be known only in probabilistic sense. The knowledge of the joint time-dependent 
probability density 

Py^m ^ni •^n—lj ^n— li • • • i *^0) ^o) i 

characterizes the process completely (we use a discretized description of all times, t < 
h < ... <tmt n being the total time). In ?? we shall use a short-hand notation denoting 

— # — * 

X the complete trajectory of the process, X = (x n , t n ; . . . ; x , t ). 
The conditional probability density 

P\X n i t ni X n —\, t n —ij . . . , Xfc_|_x, \x k , ^fcj -Ek—li tk—li ■ ■ ■ ^0) 

P\£ni ^n> -En— lj t n — 1, . . . , Xfc+i, tfc+i) ^ ^ 

P(%ki tk~, Xk-ii tk-i', ■ ■ ■ ; ^o) 

allows one to know the probability of finding future configurations given that the process 
took some specified previous values. 

One can distinguish between processes for which their full history is needed to predict 
their future (in a probabilistic sense) and processes for which the knowledge of the present 
is enough to do so. The former are called non-Markov while the latter are Markov. The 
standard example of a Markov chain is a random walk on a rf-dimensional square lattice: at 
each time-step the walker can jump to any of its 2d neighboring sites with equal probability 
l/(2d) that is totally independent of how the walker got to its present position. A self- 
avoiding random walk on the same lattice is instead a non-Markovian stochastic process: 
since the chain cannot cross on the lattice one needs to know the positions the walker 
occupied at all previous times. 6 In more technical terms, a Markov process is such that 

P(y%m t n , . . . , \ Xk, tfc, . . . Xq, to) P\£ni tni • • • 1 ^k+1 j \ ^ki ^k) ■ 

This property implies that any joint probability density can be expressed in terms of 
conditional probabilities that depend only on the present and immediately subsequent 
configurations, called transition probabilities, 

T(x k ,t k \x k -i,t k -i) = P(x k ,t k \ Xk-l,t k -\) . 

6 Note however that we refer here to dynamically generated self-avoiding walks, which does not define 
the same statistical ensemble as that relevant for polymers in thermal equilibrium. On this point, see [?]. 
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Hence, 

£ n _i) . . .T(x 1 ,t 1 \x ,t Q )P(x ,to) ■ 

Clearly, the transition probabilities satisfy 

J dx k T(x k ,t k \x k -i,t k -i) = 1 , T(x k ,t k \x k -i,t k -i) > . 

Chapman-Kolmogorov integral equations. 

The probability density P of a generic (and not necessarily Markov) stochastic process 
evaluated at t 2 is related to the one at t\ by the integral equation 

P(x 2 , t 2 ) = J dx x P(£ 2 , t 2 ; £i, ti) = J dx x P(x 2 , t 2 \x 1 , h)P(xi, h) 

where the integration runs over all possible values of x\. Note that t\ and t 2 are not 
necessarily infinitesimally close to each other. 

The conditional probabilities satisfy a similar integral equation. Indeed, 

P(f 3 ,t 3 ;fi,ti) = P(f 3 ,t 3 |fi,ti)P(fi,ti) = J dxiP^t^x^t^x^ti) 
= J dx 2 P(x 3 , t 3 \x 2 , t 2 ; x u ti)P(x 2 , h; f x , tj) 

= J dX2P(f 3 ,*3|^2,*2;^l ) *l)-P(^2,*2|^l > *l)-P(^l,*l) , (6-2) 

from where we get 

T(£ 3 , t 3 \x!, h) = J dx 2 T(x 3 , t 3 \x 2 , t 2 )T(x 2 , t 2 \x!, h) , (6.3) 

if the process is Markov. This is the Chapman-Kolmogorov equation that links the tran- 
sition probabilities. 

Generation and elimination of memory. 

Two points about the differences between Markov and non-Markov processes are worth 
discussing before going on. 

First, the minimum time-separation, 8 = t k +i — t kl that is physically observable may 
play a role in the classification of a process as Markov or non-Markov. In most realistic 
situations, if one investigated the dynamics with a sufficiently fine time grid, non-Markov 
effects would be observable. However, for many practical purposes one can assume that 
these fine details are overlooked by a sparse time-grid that is longer than the characteristic 
memory time. Thus, the processes can be considered to be effectively Markov, in the same 
way as the noise-noise correlations in the Langevin approach can usually be taken to be 
delta functions (in time). 

Besides, as we have already noticed when we rewrote the Langevin equation for the 
Brownian particle as a function of the coordinate only, the elimination of some degrees 
of freedom in the system, by integrating their dynamic equations and replacing the result 
in the remaining ones, may transform a Markov process into a non-Markov one. The 
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same can occur at the level of the probabilistic description that we are discussing in this 
Section. 

Thus, we conclude that the Markov or non-Markov character of a process may depend 
on the level of description we want to obtain and on the approach we adopt. 

Stationarity and ergodicity. 

A stochastic process is stationary when the joint-probability distribution is invariant 
under translations of time (tti): 

P(x n , *„;...; x , t ) = P(x n , t n + A; . . . ; x , t + A) . (6.4) 

In particular, this implies that the one-time probability is independent of time, P{x, t) = 
Pst(x)i and the two-time joint probability depends on the time-difference only, P{x±, t\; x 2 , t 2 ) = 
P s t{x\, t\ — t 2 ; x 2 , 0). This immediately implies that the conditional probabilities are also 
functions of the time-difference only: P{xi,t-i\x 2 ,t 2 ) = P s t{x 1: ti — t 2 \x 2: 0). For a Markov 
process the latter properties are sufficient to determine the stationary character of the 
process since all joint probabilities can be expressed in terms of one-time and transition 
probabilities only. 

Property (6.4) immediately implies that in a stationary process the the correlation 
between any number of observables, A 1: . . . , A ni evaluated at different times h < ... <t n , 
is invariant under translations of time, irrespective of the values of the time differences 

t 2 t\ ) . . . ) t n t n — i, 

CU„A„_i...Ai {tn + A, . . . , h + A) = CU„A„_i...Ai {t n , ■ ■ ■ , h) ■ 

In particular, one has CAB{t,t') = C^ B {t — t'). Similarly, one proves that the multi-time 
linear responses are also stationary when (6.4) holds. 

An ergodic Markov chain is such that all possible configuration, i.e. all possible values 
of x, are accessible from any initial condition. 

6.4 Master equation 

Let us now focus on a Markov chain. The evolution of the conditional probability density 
P{x, tk\x', t ) ° n the time sequence t , . . . ,t n , is determined by the master equation 

P{x,t k+1 \x",t ) - P{x,t k \x",t ) = - ^T{x' ,t k+1 \x,t k )P{x,t k \x" ,t ) 

x> 

+ J2T{x,t k+1 \x',t k )P{x',t k \x",t ) . (6.5) 

Sometimes, one simplifies the notation and writes P{x, t) for the unknown in this equation, 
with the initial condition P{x,to) kept implicit. The LHS is the definition of the change 
in the probability dentisty between the subsequent times t k and t k+ \. The RHS has two 
contributions: the first (negative) term represents the process of leaving the configuration 
x, the second (positive) term represents the process of reaching the configuration x. 
The master equation (6.5) can be written in a matricial form. Indeed, one recasts it as 

P(x,t k+l \x",t ) =Y,W{x,t k+1 \x',t k )P{x',t k \x",t ) (6.6) 
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with the matrix W written in terms of the transition probabilities, T. 
Balance and detailed balance. 

Any stationary solution to the master equation satisfies 

J2 T (x',t k+1 \x,t k )P st (x) = J2T(x,t k+1 \x',t k )P st (x' ) Vf. (6.7) 

x' x' 

This equation is called balance and it is a necessary condition to allow for a steady state. 
The balance condition may admit many solutions. Detailed balance is a restatement of 
the invariance of the dynamics under time-reversal in a probabilistic sense. It states that, 
in the steady state, any transition is balanced by its time-reversed, 

T(x',S\x,0)P st (x) = T(x R ,6\x? R ,0)P st (a? R ), (6.8) 

and the stationary measure is also invariant under time-reversal, 

P st (x) = P st (x R ). (6.9) 

We classified the stochastic variables of a system depending on their parity properties 
under time-reversal 

x R (t) = Xi(—t) = eiX,i(t) with q = ±1 , 

for even and odd variables, respectively. One can check that detailed balance implies 
balance. 

When the number of possible 'microstates' x is finite, detailed balance is sufficient 
(though not necessary) to ensure that the stochastic process approaches its stationary 
distribution function for long times. 

As an example let us consider a particle system characterized by the positions and 
momenta of each particle. A transition corresponds to modifying the position and mo- 
mentum of one particle from (r, v) to (r", v'). Since the backwards motion from r* to f 
occurs in the opposite direction, the reversed transition corresponds to modifying (r*, —v') 
into (f, —v). Detailed balance requires that these two processes be equiprobable when the 
system attained its stationary state, i.e. 

P st (f, ^,5^,^,0) = Pst(r,-v, 5-^,-^,0) 
T(r*,v',5\r,v,0)P st (r,v) = T(r,-v,5\r / ,-v',0)P st (i*,-v') . (6.10) 

Note that these conditions ensure that the master equation admits a stationary solu- 
tion. We still do not know if this solution is the only one or whether the process converges, 
asymptotically, to it. We shall discuss these two questions in Sect. 6.5. Furthermore, we 
have to determine, in as much generality as possible, when the stationary solution coin- 
cides with the equilibrium measure P s t(x) = P eq (x). We shall come back to these very 
important issues later. Finally, we have not discussed here the effect of external fields on 
the time-reversal properties of a stochastic process. As we discuss in ??. the external 
fields should also be time-reversed in the RHS of the generalized (6.8). 



60 



6.5 Fokker-Planck equations 



6 STOCHASTIC PROCESSES 



6.5 Fokker-Planck equations 



The integral Chapman-Kolmogorov equation can be transformed into a differential equa- 
tion under certain assumptions. Several textbooks describe this derivation [22]. Here we 
summarize some of the forms that this equation can take. 

Kramers-Moyal expansion. 

Starting from the Chapman-Kolmogorov equation, Kramers and Moyal derived a gen- 
eral differential equation for the conditional probabilities. This equation involves a series 
in which the coefficients are related to the momenta of the conditional probability. For 
non-Markov systems these coefficients depend on the full history of the process. For 
Markov processes the differential equation is local in time but it might involve an infinite 
number of terms. When the Kramers-Moyal expansion stops after the second term, as 
happens for Langevin processes with additive noise, we are left with a Fokker-Planck equa- 
tion (see also below). In the case where the process does not 'jump', i.e. all displacements 
become small in the limit 5^0, one can write [22]: 



dP(x,t\x',t') 
di 



d 



dxi 



Df\x, t)P(x,t\x',t') 



+ 



d 2 



dxidxj 



D%\x,t)P(x,t\x',t') 



(6.11) 



The drift Df\x,t) and the diffusion constants D^\x,t) are given by: 

1 f 

Df\x,t) = \im- J dx* (x' - x)iT(x',t k+1 \x,t k ) 

D$(x,t) = lim^y dx' (x' - x)i(x' - x)jT(x' , t k+1 \x , t k ). 



(6.12) 



The names given to these coefficients reflect their underlying physical nature, as will be 
clear below. The above Fokker-Planck equation describes a Brownian motion process with 
a local and time dependent drift and diffusion constants. There are however a much wider 
class of processes that admit a continuous time limit, although they lose the property of 
being continuous (in x). Suppose that the process can make jumps of non vanishing 
amplitude, but more and more as 5 — > 0, such that the contribution to the transition 
probabilities T(x' ,t k+1 \x,t k ) corresponding to these jumps are proportional to 5. In this 
case, the Kramers-Moyal expansion must contain non local spatial terms, corresponding 
to these jumps, and reads 



dP(x,t\x',t') 



d 



dt 
+ 



Df\x, t)P(x,t\x',t' 



dxi 

f(x,t + 5\x",t) 



+ 



d 2 



dxidxj 



D%\x,t)P(x,t\x',t') 



P{x\t\x\t') - T ^" ,t t 6 ^ ,t) P(x,t\x',t' 



(6.13) 



where the last term describes the jumps in the trajectories of the stochastic process, 
and T the corresponding transition probabilities. That the above decomposition between 
a (continuous) Brownian diffusion component and a jump component is unique in the 
continuous time limit is the content of the work of P. Levy on infinitely divisible processes, 
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and is called the Levy decomposition. In some cases, the jump component admits a 
fractional derivative representation, generalizing the second order diffusion term. This 
corresponds to what is known as Levy flight processes [25]. 

When the jump processes vanish, the Fokker-Planck equation can be recast in the form 
of a local conservation law. Indeed, defining the probability current 



d 



J i (x,t\x',t') = D\ L) (x,t)P(x,t\x',t')-J2j r D\: ?(x,t)P(x,t\x',t') , 

j ° X 3 



it becomes 



dP(x,t\x',t') 
dt 



E 



8Ji(x, t\x', t') 
dxi 



Two special cases: Kramers and Smoluchowski. 

The Kramers and Smoluchowski equations are particular cases that hold exactly for 
a linear Langevin process with white additive noise. The probability distribution of the 
thermal noise, -P(£) 5 induces a time-dependent probability distribution of the dynamic 
variables x and v: 



P(x,v,t) = JvtP(t;)6(x-xz(t))5(v-vs(t)) , 



(6.14) 



where we adopted the short-hand notation P(x,v,t) for the conditional probability 

P(x,v,t\x',v',t'). 

In the overdamped limit, the velocity v is slaved to the local force and P(x,v,t) is 
replaced by an exclusive function of x: 



P(x,t) = Jv£P(Z) 6 



(6.15) 



that is determined by the following Fokker-Planck equation (also called Smoluchowski 
equation in this context): 



7 



dP(x,t) d 



dt 



dxi 



P(x,t) 



dV{x) 

dxi 



(6.16) 



with the initial condition P(x ,to). [Note that this equation actually applies to the con- 
ditional probability P(x,t\x ,t ).] See 4 for a rather rigurous proof of this equation. 

In the case where inertia cannot be neglected, and noise is additive, one can establish 
that the probability density P(x, v, t) satisfies a first order differential Kramers equation: 



dP(x, v, t) 
dt 







+ 



dxi 
]__d_ 

m dvi 



ViP(x,V,t)) 

dV{x) 7 T d 



7«i + 



dxi 



+ ——\P(x,v,t) 
m dv; / 



(6.17) 



with the initial condition P(x ,v ,t ). 

It is very important to note that the balancing of factors on the RHS of the Kramers 
and Smoluchowski equations is a direct consequence of the equilibration of the noise (see 
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Sect. 6.1.2) when the equations derive from a Langevin process. It is totally equivalent to 
the relation (6.10) between the strength of the noise-noise correlator and the friction coef- 
ficient. More generally, it is a particular case of the detailed-balance condition (6.8), that 
calling Hf p (x) the operator acting on P(x,t) in the RHS of the Fokker-Planck equation, 
reads 

H fp (x)P st (x) = P st {{e lXl })H] p {{e tXl }) . (6.18) 

The Fokker-Planck equation for a stochastic field. 

A Fokker-Planck equation can also be deduced for a stochastic variable that is actually 
a (i-dimensional field, <j>(x, t) = (0i(x, t), . . . , (f) d (x,t). It the natural generalization of the 
equations presented in the previous section. In the Smoluchowski limit, and for additive 
noise it reads 



dt 



6</> a (x,t) 



5<j> a (x,t) 



+ T^-P(lt) 



Approach to stationarity. 

Under rather mild conditions one can prove that if a stochastic process governed by 
a generic master equation admits a stationary state asymptotically, this one is unique. 
We present a proof of this statement below. A very simple exception to this rule is given 
by "decomposable" systems made of two or more non-interacting systems in which case 
one can construct several stationary asymptotic solutions given by linear superpositions 
with arbitrary coefficients of the stationary solution for each subsystem. Another case of 
exceptions is given by certain problems with a continuous set of possible states. We shall 
not discuss these special cases here but focus on the more generic situation. 

Possibly, the simplest way to test under which conditions the time-dependent solution 
to the Fokker-Planck equation approaches a stationary form asymptotically is to use the 
"7i-theorem" that we discuss, with other purposes, in Comp. ?? [?, ?, ?]. The ^-functional 
or Lyapunov-functional 

H Pl , P2 {t) ee / d£c(^£j P 2 (x,t) , (6.19) 

measures a "distance" between the (normalised) pdfs, Pi and P 2 . C(y) is any strictly 
convex function, i.e., it satisfies J2iC(uJiyi) > J2iC(yi)uJi for J2i^i — 1 and ooi > 0. 
Customarily one uses C(x) = xlnx which suggests to relate H to a non-equilibrium 
extension of the entropy concept via S(t) = S —H(t) with S a the thermodynamic entropy. 

Now, Hp lt p 2 (t) is bounded from below and it is a monotonic decreasing function of 
time when (a) the diffusion matrix D^f is positive definite 7 , (b) the drift matrix does 
not have singularities (that correspond to infinite high barriers that render the problem 
decomposable) and (c) Pi and P 2 are different from zero away from infinity. Thus, the 
distance between any two normalized solutions vanishes asymptotically. 

When the drift and diffusion matrices do not depend on time, the Fokker-Planck equa- 
tion may admit a stationary solution. Based on the above argument, any other normalized 

7 The argument needs to be slightly modified for the Kramers equation given that its diffusion matrix 
is not positive definite; still, one also proves that the asymptotic solution is unique in this case [?]. 
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solution necessarily approaches the stationary one in the long-time limit. Two types of 
stationary solution can be identified. The simpler ones have constant current Ji(x) = Ji 
fixed by the boundary conditions. Natural boundary conditions demand Jj = 0, when 
oo for all j, thus, J« = everywhere in space, in which case the stationary solution 



x 



takes the potential form 



P st (x) oc e 



-4>{x) 



0(f) = - / dx' k 



D 



(2)- 
hi 



dDlf(x') 



where a sum over repeated indices is assumed. In the simple case D 



(!), 



Ox/ 



X 



-d Xi V{x) 



and D)/ (x) = Tb^ one easily checks that this expression becomes the Boltzmann factor. 



The question then arises as to whether an explicit solution is possible when the drift term 
is not of a potential form. 

Another family of solution correspond to non-equilibrium steady states (ness) with a 
divergenceless, time-independent current that is not necessarily constant in space. When 
the drift term itself is divergenceless and is constant - corresponding for example 
to tracer particles advected by a incompressible convection flow and subject to molecular 
diffusion - then the NESS is trivial and is given by P s t{x) = 1/V, where V is the total 
volume accessible to the particle. The local current is then given by Ji = Df\x)/V and is 
by construction divergenceless. Another case that can be solved in full generality is when 
the drift term is the sum of a potential part and a divergenceless part that are everywhere 
orthogonal to each other. Then, the divergenless part simply advects the particles along 
equi-potential lines and does not modify the standard equilibrium Boltzmann factor. In 
the completely general case of an arbitrary drift and diffusion structure, there are in 
general no explicit construction of the NESS (for examples where this construction is 
possible, see Haake, Derrida et al., Jona Lasinio). 

Approach to equilibrium. 



Whether a stochastic process approaches equilibrium asymptotically depends on the 
nature of the forces applied, the boundary conditions, etc. In the following we shall 
focus on Fokker-Planck processes with no jumps. Among these, a subclass admit a spec- 
tral representation of the Fokker-Plank operator with generic complex eigenvalues. The 
asymptotic analysis has to be performed on a case-by-case basis. 

A more restrictive class of systems are represented by self-adjoint FP operators. The 
potential case, D^f = T5ij and = —d Xi V, is a special problem in this class and one 
proves that the process does indeed approach a stationary solution that is given by the 
canonical equilibrium measure. 

An easy and elegant proof of this statement, for a Smoluchowski potential problem relies 
on a mapping of the Fokker-Planck equation to the Schrodinger equation [?]. Introducing 



P(x,t) =ce-^ v{3) p(x,t) 
with c a positive constant the Fokker-Planck equation becomes 

dpjx, t) 
dt 



(6.20) 




p(x , t) 



(6.21) 
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This is a Schrodinger equation in imaginary time. The potential energy term, say V s , is a 
function of the original potential V. p(x, t) is related to the probability density and plays 
the role of a wave function, while in true quantum mechanics it is the modulus squared 
of the wave function which has a probability interpretation. The operator L s admits a 
spectral representation in terms of its eigenvalues \ n and its eigenvectors tp n . Since the 
potential V s is independent of time, the eigenvectors, are time-independent, Tp n {x). If 
the potential V s grows to infinity sufficiently fast when x,- L — > ±00, for all i, the spectrum is 
discrete and there is gap between the lowest eigenvalue, A , and the first excited state, Ai. 
The ground state wavevector is everywhere positive and can hence describe a probability. 
One can also show that the operator L s can be written as the square of a certain operator, 
and is thus positive semi-definite in a Hilbert space 8 and the eigenvalues, A n , are real and 
satisfy A n < 0. 

The solution to eqn (6.21) can be expressed as the series 



with c n arbitrary numerical constants. When a stationary solution exists, the operator 
has a vanishing eigenvalue, A = 0, that, if the conditions mentioned in the previous 
subsection are satisfied, is not degenerate. One has p s t(x) = c ip (x). The eigenstates 
associated to non-zero eigenvalue can be degenerate in which case one needs a more 
refined notation to distinguish their associated eigenvectors. We skip this detail and 
we continue to use the simplified notation above. If there is a gap in the spectrum, 
Ai > 0, in the long-time limit only the contribution of the zero eigenvalue survives and 
p(x,t) — > p s t(x) = cq%1)q{x). The constant Co is fixed to one by the normalization of the 
probability. Indeed, c = / dxip o (x)p(x : 0) = J dxP(x,0) = 1. 
Going back to the original PDF, P(x,t), one has 



where we used the conservation of probability to compute c~ 2 . Thus P eq is indeed the 
asymptotic solution to the Fokker-Planck equation. 

Note that this argument assumes that a sufficiently long t (t > t eq ) is reached such that 
only the A = term survives in the sum. This hypothesis does not hold in the asymptotic 
analysis for the relaxing models we analyze in the next Sections. If the next eigenvalue Ai 
does not vanish, its inverse is the time-scale needed to equilibrate the model. If however, 
there is no gap in the spectrum, one does not have a simple argument to estimate how 
long one should wait until the asymptotic limit is reached. This is a question that will 
be raised regularly in the treatment of glassy dynamics. Moreover, when non-potential or 
time-dependent forces are exerted on the system the transformation (6.20) is not sufficient 
to deal with their effect and equilibrium cannot be established. 

Since the eigenvectors corresponding to excited states are not everywhere positive, one 
cannot interpret vpo^n directly as a probability. Interestingly however, one can construct 
linear combination of these that can be interpreted as metastable states, that can be 
defined as states with lifetimes longer than a certain fixed but long time-scale. 



8 This space is defined via the scalar product (/, g) = J dxf(x)g(x)/P st (x) and an operator C is 
self-adjoint if (f,Cg) = {Cf,g). 



p(x,t) = ^c n ^ n (x)e 



(6.22) 



n 
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6.6 Quantum mechanical formulation 

Kadanoff and Swift proposed to use a quantum mechanical notation to represent Fokker- 
Planck equations. Note that this is not identical to the above mapping to a Schrodinger 
equation since the Fokker-Planck Hamiltonian is non-Hermitian. This notation is very 
useful to prove simple properties of stochastic processes as the ones holding near equilib- 
rium that we discuss in Sect. ??. Indeed, identifying —id/dxi with the operator fa the 
usual commutation relations between momenta and coordinates are recovered: 



while = and [pi,pj] = 0, for all The probability distributions P(x,t) is then 

identified with a quantum time-dependent "state" \P(t)) and the Fokker-Planck equation 
reads 

where Hf p is written in terms of the canonically conjugated operators x and p, and the 
transition probability is 

T(x,t + r\x',t) = (f|e-^" T |f) . 
The detailed balance property (6.18) implies that 

e^^)H fp (x)e~^ = H} p ({e iXi }) , (6.23) 

The form of the Fokker-Planck operator depends on the stochastic processes considered. 
In the case of a stochastic processes of Kramers' type, describing a one dimensional system 
with position x and momentum p, one works with two "position operators", (x,p), and 
two "momentum operators", {p x ,Pp), and the Hamiltonian reads 

1 / 'y *yT 

Hk = -ip x v H ip v V'(x) + ip v —v H -ip l 

m \m m z 

For the Smoluchowski equation one has one pair of conjugated variables, (x,]3), and 

H S = ^P (iV'{x)-Tp) . 

6.7 The fluctuation - dissipation theorem 

The fluctuation-dissipation theorem (fdt) relates the correlations of spontaneous equilib- 
rium fluctuations to the response induced by an external field. It is a model independent 
relation between the linear response and its associated correlation function that takes 
somewhat different forms for classical and quantum system. The latter reduces to the 
former when quantum fluctuations become irrelevant. In this Section we present several 
proofs of the fdt. When the system is out of equilibrium hypothesis, on the other hand, 
this relation in general does not hold, although a generalized version of the fdt can in 
some cases be established. 
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6.7.1 Static FDT 

Many relations between correlations of fluctuations and susceptibilities are known in sta- 
tistical mechanics. All these are different statements of the static FDT. 

Take for instance a perfect gas. The fluctuations in the density p = n/V where n is 
the number of particles within a sub-volume V of a system with N particles and volume 
V, are defined as: a 2 = ((p — (p)) 2 ). In the thermodynamic limit N — > oo, V — > oo with 
N/V = p fixed, these are related to the isothermal compressibility xt = — V^dV / dP\ T 
via a 2 = Tp 2 xr/V. This relation is a form of FDT. 

For a system in equilibrium with a thermal reservoir at temperature T one has 

5{A) } 



X 



5h 



(3{{A-{A)Y) (6.24) 



h=0 



for any observable A. The average ( ) h \s calculated with the partition function of the sys- 
tem in the presence of a small field coupled to A in such a way that the Hamiltonian reads 
H = Hq — hA. For a magnetic system this equation relates the magnetic susceptibility to 
the magnetization fluctuations. 

When the order parameter is a field, for instance a scalar one, described by a Hamilto- 
nian or a Landau-type free-energy that is linearly modified by an external perturbation, 
JF[0] — > J 7 [ft] — / d d x h(x)(f)(x), the static FDT (6.24) gets generalized to 



5h(x') 



= (3(<t>{x)<t>{x*)) c = (3C{x,x') 

h=0 



6.7.2 Dynamic FDT 

There are several proofs of this theorem. We present two of them that are particularly 
instructive since they use, very explicitely, the two necessary hypothesis which are i. 
detailed balance, and ii. equilibration. 

FDT from the master equation. 

For the sake of simplicity, let us consider a system with a single dynamical variable 
x that evolves according to a Markov process on a discrete time-grid tk = kS. In what 
follows we simplify the notation and we eliminate the time-dependence from the transi- 
tion probabilities, T h (x k \x k _i) = T h (x k ,t k \x k -i,t k -i). The subindex h indicates that the 
perturbation is present during the time-step — > t k . If the system reached equilibrium 
at time t k -\ one has 

II T ( x i \xi-i)P(x , t ) = P eq {x k _ 1 ) . (6.25) 

x ( ;l<J<fc-l 

We will use a superscript h whenever equilibration has been achieved in the presence of 
a field, P£(a; fc _i). 

The correlation between two observables A(x) and B(x) in the absence of any pertur- 
bation is given by 

(A(x J )B(x k )) = J2Mx J )T( Xj\xj-i) . . .T(x k+ i\x k )B(x k )T(x k \x k _i)P eq (x k ^i) . 
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The variation with respect to h of the mean value of A(x) evaluated at time tj after a 
step-like perturbation of strength h that modified the energy as H — > H — hB(x) between 
tk-i and t k is 



HA( Xj )) h 



J2 A{xj)T{x j \x j - 1 ) . ..T(x k+1 \x k ) 



5T h (x k \x k . 



5h 



Peq{Xk-l) 



h=0 



Let us assume that the variable x is even with respect to time-reversal. In this case, and 
if detailed balance holds, T h (x k \x k -i)P^, \(x k -i) = T h (x k -i\x k )P* l (x k ), and 



ST h (x k \x k ^) 5 ( P* q {x k ) 



Sh 



5h \PUxk-i), 



T h (x k „i\x k ) + 



Pe q {x k ) 5T h (x k ^\x k ) 



5h 



(6.26) 



that has to be evaluated at h 

oh 

eq 



0. The first term is easily calculated by replacing 



P e h Jx k ) = exp (-pV(x k ) + PhB(x k )) /Z h and yields 



/3(B(x k ) - fl(a*_i)) ^ e f Xk \ T(x k ^\x k ) . 



(6.27) 



The second term in eqn (6.26) vanishes identically; indeed, after simplification, x k -i is 
seen to appear only in the derivative of the transition probability. Using the h independent 
normalisation condition, T h (x k -\\x k ) = 1 one finally finds that this term is zero. 



Thus 



8{A{ Xj )) h 



Shi 



= (3 ( A{xj) (B(x k ) - Bfa-t)) ) , j > k . 



(6.28) 



h=0 



This relation expresses the fdt in discrete time. Interestingly enough, we did not need 
to specify the h dependence of the transition probabilities T, provided this dependence is 
compatible with detailed balance. 

Note that since in equilibrium the averages of one-time quantities are constant one can 
replace (A(xj)B(x k )) by the connected correlation (A(xj)B(x k )) — (A( X j)){B(x k )) and 
the fdt also reads 



6{A( Xj )) h 



5h k 



(3 ( A{xj) (B(x k ) - (B(x k )) - B(x k ^) + (S^-O)) ) , j > k . 



h=0 

In its integrated form 



Xji 



> S(A( Xj )) h 

k=l 



Sh k 



= P({A(x j )B(x j ))-{A(x j )B(x l ))) . 



(6.29) 



h=0 



The RHS can also be written in terms of connected correlations. 

The LHS in eqn (6.28) is just the linear response, R,AB(tj, while the RHS is a dis- 
cretized time-derivative of the correlation, Cab(^, t k ), with respect to the earlier time 
t k . We have already assumed that the system is equilibrated at time t k -±. Thus, all 
correlations and responses are stationary and we have: 



P-AB^i — tk) — P( C s AB {tj - t k ) - C s AB (tj - t k -i) ) , j > k . 
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that in the continuous limit t k — tk-i — > becomes 

R S A B (t -?) = P d v C%{t -t'), t>t' . (6.30) 

One can also express the FDT in an equivalent form in which the correlation is replaced 
by the displacement: 

R s Mt-t') = -^d t ,AUt-t'). 

In a problem that undergoes normal diffusion this expression implies the Einstein rela- 
tion (6.13). 

If one deals with a more complicated Markov chain with several degrees of freedom, and 
with both even and odd components of the transition rates with respect to time-reveral, 
the proof carries through in the same way iff the perturbation applied does not break 
time-reversal. In other words, perturbations that modify the Hamiltonian as 

H^H- hB({ Xi }) with B({ Xi }(-t)) = B({ Xi }(t)) 

admit an FDT of the form given above. For instance, if we deal with a particle characterised 
by its position and velocity, the function B can have any depence on the position but it 
must be an even function of the velocity. An example would be to perturb the kinetic 
energy of the particle. 

A fluctuation-dissipation relation for the velocities takes the form: 

TR vx (t,t') = T 6 -^0± = (v(t)v(t') ) = C vv (t,t') , (6.31) 

when the field h perturbs the system by being coupled to the coordinate at time t'. This 
property is a version of a Kubo formula and it can be useful in problems in which it is 
easy to measure a current. Being very sloppy, one can "derive" it from (??) by taking a 
derivative with respect to t on both sides and exchanging the average over the noise and 
the time-derivatives (a highly non-trivial operation). 

FDT in the frequency domain. 

Experimental devices often measure the correlation and response function in the fre- 
quency domain. The Fourier transform of the FDT relation (6.30) reads: 

Xab(uj) = (3 (CUO) + iu> 1°° dr e^C&O")) ■ (6-32) 
The imaginary part of this expression leads to the FDT in the frequency domain: 

= Re I" dr e^ T C s Ur) 
uj Jo 

There is no extra information in the real part, since it is related to by the Kramers- 

Kronig relations (6.32) or (6.33). 

When the FDT is written in the above form we see the justification of its name. As 
discussed in Sect. 6.2 x"( u ) quantifies the dissipation and the RHS is related to the power 
spectrum of the spontaneous fluctuations in the system. 

6.8 Montecarlo and Glauber dynamics 
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7 Interface and surface growth 

An inteface is a frontier separatiing two regions of space with two phases. For instance, it 
could be the border between water and oil in a liquid mixture, the border between regions 
with positive and negative magnetization in a magnet, etc. As a physicist one would like 
to characterize the static and dynamic properties of such a surface. 

7.1 Domain growth 

Take a magnetic system, such as the ubiquitous Ising model, and quench it into the 
low temperate phase starting from a random initial condition. In the course of time 
neighbouring spins will realize that their preferred configuration is an ordered one and 
domains of the two ordered phases will form and grow since in some regions it will be one 
of them that will prevail while in other regions it will be the otherone. At any finite time 
the configuration will be such that both types of domains exist. 

The typical radious of the magnetic domain can be determined with a number of 
arguments in the case of ferromagnetic pure models. One finds 

R(t) ~ t 1/2 , (7.1) 

and the domains and their surface are compact (i.e. they have dimension d and d — 1, 
respectively). Another question one may would be interested in is in characterizing the 
distribution of the sizes of these domains and its evolution. This is known in d = 1 but 
much less can be said about the higher dimensional problem. 

A different type of dynamics oocurs in the case of phase separation (the water and oil 
mixture). In this case, the material is locally conserved, i.e. water does not transform 
into oil but they just separate. Determining the growth and geometrical properties of the 
domains is already much harder in this case. After some discussion it was established, as 
late as in the early 90s, that for systems with conserved order parameter as the example 
at hand, the growth is given by 

R(t)~t l '\ (7.2) 

The situation becomes much less clear if one has quenched random interactions. Al- 
ready in the case of the random ferromagnet, for example, there is no consensus about 
the growth law. In the case of spin-glasses, if the mean-field picture with a large number 
of equilibrium states is realized in finite dimensional models, the dynamics would be one 
in which all these states grow in competition. If, instead, the droplet model applies, there 
would be two types of domains growing. Some refined arguments that we shall not discuss 
here indicate that the dimension of the bulk of these domains should be compact but their 
surface should be a fractal. 

Note that if one starts such a dynamics in a very large (L 3> a) system with no biasing 
magnetic field the system will never magnetize in finite times with respect to L. More 
explicitly, if the growth law is the power law (7.1) one needs times of the order of L 2 to 
grow a domain of the size of the system. For any shorter time, domains of the two types 
exist and the system is out of equilibrium. This systems constitutes a first example of 
a problem with slow dynamics. Whether all systems with slow dynamics, in particular 
structural and spin glasses, undergo some kind of sumple though slow domain growth is 
an open question. 
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7.2 Phase space dynamics 

It is commonly done in the study of disordered systems or, better said, in the applications 
of disordered systems and ideas in other areas as biology, economy, etc. to discuss the 
dynamics in phase space. Adopting a mean-field-like viewpoint in which disordered sys- 
tems are characterized by many metastable states of all types, one pictures the dynamics 
of the system as the wandering of a representative point in a phase space characterized 
by a rough (free)-energy landscape. This viewpoint can be very useful if correctly used 
but it can also be misleading. 

Indeed, phase space if a highly dimensional (2^ in a spin system) space and one can only 
grasp three dimensional geomtries. To show this problem, let us come back to a magnetic 
systems that is quenched to its low temperature phase. The initial, high temperature 
configuration, is not magnetized. It corresponds to the unstable maximum of the free- 
energy density f(m). Naively, one would imagine that in time the point that sits on top 
of this barrier very quickly falls to one of the two mimima corresponding to ±m, the 
equilibrium values. Instead, we know that this happens in a rather slow time scale and 
for all times t < L 2 the system is not magnetized and just sits on top of the maximum. 

Moreover, in more complicated systems, as mean-field spin-glass models, the relevant 
free-eenrgy density is not a function of just one parameter but N of them, the N local 
magnetizations. The dynamics is not just given by jumps from one minimum to another 
and saddle-points take an unexpected importance that might go beyond these solvable 
but unrealistic models. 



7.3 Surface growth 

Another interesting problem consists in characterizing the growth of a surface on a sub- 
strate due to material deposition combined with a transverse diffusion of material that 
reaches the surface. 

The simplest model for the growth and rearrangement of a surface is due to Edwards 
and Wilkinson who showed that the continuum limit of the process of sedimentation of 
granular particles under gravity on an initial flat substrate and their further diffusion on 
the surface leads to 

^^ = uV 2 h(x,t)+ax,t), (7.3) 

where x is a rf-dimensional spatial vector denoting position on the substrate, t is time 
and h is a scalar function taking real values and measuring the height with respect to 
its average value. The last term is a thermal noise, typically chosen to have a Gaussian 
probability distribution a zero average and correlations 

( £(£, t)£(a>, if) ) = 2T5(x - x')5{t - if) . (7.4) 

Equation (7.3) is a stochastic field equation. The first term on the RHS of eqn (7.3) 
penalizes rapid variations of the surface in space and v is then a measure of the surface 
tension. The noise term describes the randomness in the deposition process. Note that the 
Edwards- Wilkinson equation can in fact be seen as describing the equilibrium fluctuations 
of an interface (for example liquid vapor), with the noise term describing the thermal 
fluctuations. The stochastic equation (7.3) is linear and can be easily solved using a 
Fourier transform of the space coordinate x. 
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When the interface is driven out-of-equilibrium by an external force (for example a 
non zero average flux of particles impinging on the surface, or a difference in chemical 
potential between the two sides of the interface), then a new, non linear term of the form 
[Vh(x, t)} 2 should be added. This leads to the well-studied Kardar-Parisi-Zhang equation 
with many applications in various fields. 

Usually, the kinetics of a surface is characterised by the width of the surface that is 
defined as the mean-squared displacement of the total height and depends on time and 
the linear size of the substrate, L. Asymptotically, this quantity has a scaling limit given 
by: 

W 2 (L, t) = ( (h(x, t) - { h(x, t)}) 2 } = L a T{tL z ) , (7.5) 

where the roughness exponent a, the dynamical exponent z and the scaling function T 
are defined by the above equation in the large L limit. The average denotes either an 
average over thermal histories or an average over space, ( • ) = Aj} J2x ' with the area of 
the substracte of linear dimension L indicated by A L . Recently, there has been growing 
interest in characterising the complete dynamic probability distribution P L (W 2 ,t) since it 
has been suggested that it might be used to define universality classes for non-equilibrium 
steady states via the scaling function: 

Note the similarity between this problem and the one of the fluctuations of the global 
magnetization in the critical 2d XY model that we discussed in Sect. 4.7.5. Indeed, the 
two problems are intimately connected since the dynamics of the 2d XY model in spin- 
wave approximation is given by eq. (7.3). 

The above definition leads to what can be called a "global" characterisation of the 
roughness of the interface, averaged over the whole surface. Another, more local, charac- 
terisation of the roughness of the surface, is defined as 

A(f,f',t,t') ee ((h{x,t) -h{x',t')f) = \x-x' \ a F{\t-t'\\x-x' | 2 ) , (7.6) 

where we are now looking at the limit A -1 <C \x — x'\ <C L. Often, the exponents and 
the scaling function defined globally over the whole surface (as in eq. (7.5) coincide with 
their local counterpart, but this is not always the case. 
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A Probability and statistics 

We are used to reading the word "statistics" in many contexts of science but also everyday 
life. The word statistics has its origin in the Latin "status" as well as the word "state", 
suggesting that governments used statistical concepts since long ago. In a few words, 
statistics is the theory that allows one to make sense out of a list of numbers; it is the 
branch of mathematics that deals with the analysis of data. One identifies two sub- 
branches: 

1. Descriptive statistics. The goal is to obtain useful information from a series of 
raw data that is typically too large to deal with directly. For instance, present 
experiments in molecular biology present the "difficulty" of yielding too much raw 
data that need processing before becoming useful. Descriptive statistics is a set of 
tools, or mathematical manipulations of the raw data, that convert them into a few 
numbers and plots that are easy to understand. 

2. Inferential statistics. The goal is to obtain useful information about a very large 
population being able to test only a sample, that is to say, a small portion of the 
total population. The typical example of the application of inferential statistics are 
election polls. Clearly, one is not able to ask every citizen for which candidate he or 
she is going to vote. However, one can ask a sample of the population and infer from 
the result which is going to be the global one. One of the main difficulties in this 
case is related to the choice of the sample. Clearly, if the sample is taken exclusively 
from a posh neighbourhood the result will be different from the one obtained from a 
poor one. Similarly, if only aged people are consulted, the result might be different 
from the one obtained using a sample of young people. 

Probability Theory is at the basis of Statistics. It allows to identify generic behaviour 
of random events. The mathematical theory of Statistics developed mostly in England, 
during the beginning of the XXth century. It was also the time when it became clear 
that a purely deterministic description of physics was not feasible with the development 
of Statistical Mechanics and Quantum Mechanics. [Interestingly enough, many develop- 
ments in the Theory of Statistics were the consequence of research in Agriculture (the 
analysis of the effect of fertilizers!) by Sir R. Fisher.] 

Let us here recall the definition and main properties of probabilities. If one tosses a 
coin it is practically impossible to predict if it will fall heads or tails (try to solve Newton's 
equations for the coin and air...) However, repeating the experiment many times one can 
predict general patterns of the head-tail sequence. This is a general feature of random 
events: one is unable to predict the outcome of a single experiment but one can predict 
some general features of a sequence of them. The same feature applies to determine the 
position and velocity of a particle in a gaz, and so many other physical systems involving 
many agents in interaction. 

1.1 Frequency and probability 

If we toss an unbiased coin we expect all sequences of heads (H) and tails (T) to be equally 
likely. Why? This is a very important assumption] How can we justify it? 
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Imagine that we toss the same coin a very large number of times, n — > oo, we count 
the number of H and Ts that we obtain, and we construct the probability or large N limit 
frequency: 

P(H) = lim /„(#H) = hm . (1.1) 

P(T) = hm /„(#T) = lim ( n ~ * Rs ) = l _ P ( H ) . (1.2) 

This definition quantifies the subjective idea 'what do we expect to get after tossing a 
coin?' If the coin is not biased, if n is sufficiently large, we should get one half of Hs and 
one half of Ts. Thus, for n — > oo: 

p = P(H) = ± = l-P(T) q = P(T)= 1 -. (1.3) 

If, instead, the coin is biased, H (or T) will appear more frequently. Then p > \ and 

q < \ or viceversa. 



1.2 "Ideal experimental' definition of probability 

Based on the toin coss example it is natural to define in general the probability of an event 
as the value taken by the frequency in the limit n — > oo: 

P(£) = lim f n {£) (1.4) 

n— >oo 

where £ denotes the event we are interested in, e.g. getting 3Hs after tossing n — > oo 
coins. 

Let us mention that one is used to listen to weather reports in which people talk about 
the "probability of having rain tomorrow". In this case, the definition of probability as 
a limiting procedure is much less clear and becomes much more subjective. We stick to 
objective cases here where the above definition can be safely applied. 

But, it is actually not necessary to go to such extreme example to realize that sometimes 
the definition above may be ambiguous. This problem has been discussed in the past. 
Kolmogorov gave a rigurous definition of probability that we shall not discuss here. We 
shall simply say that it is based on using the list of properties that we discuss below as a 
definition of probability. 



1.3 Properties 

We now list a series of properties of probabilities. Some of them are obvious and follow 
simply from the frequency-based definition given above. Some other are not and constitute 
new definitions. 

Semi-positive definite. It is clear that the probability of an event is a quantity that can 
take only positive or zero values: 

P{£) > . (1.5) 
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Bound. It is also clear from its definition that the probability of an event is bounded by 
one: 

P{£) < 1 . (1.6) 

Normalization. Note that the sum of the number of occurrences of each event is equal to 
the total number of possible event. Thus, the sum of the frequencies over the events is 
normalised to one, and the sum of probabilities is also normalised to one: 

E P (^) = E f J™/n( £ ) = 1 ( L7 ) 
£ £ 

(assuming that one can exchange sum and limit). 

Probability of the complementary event. If the probability of occurrence of an event, £, is 
P(£), the probability of non-occurrence of the same event (its complementary event, E) 
is 

P{£) = 1 - P(E) . (1.8) 

This is just a consequence of the normalization of the probabilities. 

For example, when we introduced the coin toss problem before we said that when 
tossing a normal coin one finds H one half of the times and T the other half of the times. 
In toher words, we assumed that the probability of getting H in a toss is 1/2 and the 
probability of finding the complemetary event T is 1 — 1/2 = 1/2 too. 

Addition principle. For any pair of events £\ and £2, 

P(E 1 \/£ 2 ) = P{£i) + P(E 2 ) - P(£if\£ 2 ) ■ (1.9) 

The symbol V represents the logical (not exclusive) "or" while the symbol A represents 
the logical "and". The meaning of the logical or is that the event £\ \j £ 2 is true whenever 
£1 is true, £ 2 is true or both E 1 and £ 2 are true. The meaning of the logical and is that 
the event £\ A £2 is true only if E 1 and £ 2 are true. 

Let us illustrate this property with an example. Imagine that one is playing with two 
dices, a red and a blue one (to make them distinguishable). What is the probability for 
getting 1 with the red one or 2 with blue one? 

This problem is represented mathematically as follows. The event £\ is getting 1 with 
the red dice. The event £ 2 is getting 2 with the blue dice. There are 6 x 6 = 36 possible out- 
comes of throwing the two dices, i.e. getting the pairs (1, 1); (1, 2); . . . ; (2, 1); (2, 2); . . . ; (6, 6) 
where the first element in each pair is the result of the red dice and the second one is the re- 
sult of the blue dice. The event £\ \j £ 2 is true in the cases (1, 1);(1, 2);(1, 3);(1, 4);(1, 5);(1, 6); 
(2, 2);(3, 2);(4, 2);(5, 2);(6, 2), so there are 11 true realizations and, since the dices are not 
biased, after n — > 00 repetitions of the experiment "throwing the two dices" we expact to 
find 11/36 successful tries. Thus, the probability of this event is P{£\ M £ 2 ) = 11/36. 

Now, the number of positive outcomes for the event £\ is just 6. The same applies 
to the event £ 2 . The sum of these two numbers is 12 and it does not coincide with 
result above. The reason why is that the event (1, 2), that solves the problem, is counted 
twice. Once in the number of positive outcomes for E 1 and once in the number of positive 
outcomes for £ 2 . One needs to correct this double counting and this is why there is the 
last term in (1.9). Taking this into account, one finds that the right-hand-side of this 
equation predicts P{£\ \J £ 2 ) = (6 + 6 — l)/36 = 11/36 which is the correct result. 
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Particular case: mutually exclusive events. When the events £\ and £ 2 are mutually ex- 
clusive, £\ /\£ 2 = and the formula (1.9) simplifies to 

P(£i\/ £2) = P(£i) + P&) ■ (1.10) 

Working again with dices, mutually exclusive events are 'getting 1 and 2 with one dice 
after one throw. There are plenty of other examples of this sort. 

Joint probability. One calls in this way the probability of simultaneous occurrence of a 
number of events, 

P(£ 1 ,£ 2 ,...,£ n ) . (1.11) 
The comma means here the same as the symbol A in (1-9). 

Independence principle. If two events can occur independently of the realisation of the 
other, the probability of the simultaneous occurrence of them is simply the product of the 
individual probabilities: 

P(£ 1 ,£ 2 ) = P(£ 1 )P(£ 2 ). (1.12) 
The generalisation to n independent events is straightforward: 

P(£ 1 ,£ 2 , ...,£ n ) = P{£i)P{£2) ■ ■ ■ P(£n) ■ (1-13) 

As an example one can imagine playing with dice and cards and asking about events 
that are associated to the dice and to the cards, independently. 

Conditional probability. The probability of the occurrence of an event event £\ conditioned 
to the occurrence of another event £ 2 is 

p(£ ^ - PJ wt ■ (1 ' 14) 

Clearly, if £\ and £ 2 are independent events, P{£\\£ 2 ) = P{£\) 

In the TDs we shall see plenty of examples that illustrate these properties. 



1.4 Discrete and continuous random variables 

A careful experimentalist performs a measurement many times in identical conditions, 
then calculates the frequency of each result and from them, in the limit of a very large 
number of measurements (that in real life are never infinite!) estimates the probability of 
each result. We call random variable the result of the experiment. 

For instance, in the coin toss problem the random variable, let us call it x, takes 
only two possible values, H and T. We can associate these two exclusive results with the 
numbers and 1 and then call the random variable x bimodal. Bimodal random variables 
are discrete since they can only take values on a discrete set (0, 1 in this case). 

Other random variables can take values on continuous sets, as the real numbers, and 
are hence called continuous random variables. Imagine that we look at the absolute value 
of the velocity v of a particle moving within a gas. In principle, this value can be any real 
number (most probably bounded...) and hence v is a continuous random variable. 

When dealing with continuous random variables we have to be more precise about what 
we mean by the probability of an event. In this case, the quantity that is well defined is the 
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probability for the event to take values within a given interval. In the case of the velocity 
of a particle in the gas, we ask what is the probability that the particle has an absolute 
velocity comprised between v 1 and Vi + Av. In the limit in which Av is infinitesimally 
small (Av — > dv) this allows us to define the probability density as the probability for the 
random variable to take values within the infinitesimal interval of length dv starting at 

p(v)dv (1-15) 

We use lower cases to denote probability densities and upper cases to indicate probabilities. 
Next we define quantities that characterise the behaviour of random variables. 

Probability distribution. The set of probabilities associated to a random variable is called 
a probability distribution. The probability distributions may be represented by tables but 
it is far more convenient to represent them with formulaeor diagrams that generalize the 
frequency plot in Fig. ??-right. These are just plots with P(x) in the y-axis and x in the 
x-axis. 

Cumulative probability. With the probability distribution of a discrete random variable 
one constructs a cumulative probability that is simply the probability for the random 
variable to take a value that is larger than some chosen one: 

F( Xl ) = ]T P(x) . (1.16) 

X>X\ 

Similarly, for a continuous random variable, 

F{x x ) = / dxp{x) . (1.17) 



X'l 



Expected value, mean or average The expected values of a discrete and a continuous ran- 
dom variable are 

E(x) = J2^P(x) , E(x) = f dxp(x)x , (1.18) 



respectively. 

Median. It is the "middle" value: 



/•^median fOG 
dxp(x) = / dxp(x) (1.19) 
"°° J x median 



The word median originates in the Latin medius. 
Mode. It is the most probable value: 



dp(x) 

x mode ■ \ dx 



0,^1 >0- (1-20) 

^mode 



x mode 

The mode is not necessarily unique. 

Variance. The variance of a discrete and a continuous random variable are defined as 

al ee £(x - E(x)) 2 P(x) al= I dx p(x)(x - E(x)) 2 , (1.21) 
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respectively. It measures the spreading of the pdf, i. e. the variance is small if all data are 
concentrated around the mean and it is large otherwise. 
The variance is well-suited to distinguish rare events. 

Standard deviation. It is just given by 

a x = ^i- (1.22) 

since it is measured in the same units as the average it is hence directly comparable to it. 

Relative variability. It is a (adimensional) comparison between the standard deviation and 
the average: 

rv x = j- . (1.23) 

It is relevant to compare the spread to the average of the data. 
Momenta The A;-th momentum of a random variable is defined as 

fjS k) ee ]>> fe P(:r) ^ ee fdx x k p(x) . (1.24) 

x J 

Indeed, knowing all momenta one can reconstruct the functional form of P or p. 

Correlation. The correlation quantifies the similarity between two (or more) random vari- 
ables. There are many possible definitions of correlation and each may be better adapted 
to some problem. 

Let us take two random variables x and y described by the jopint probability distrobu- 
tion p(x,y). For example, x and y may represent two different observables of a physical 
system. 

The correlation between the two sets X and Y is then defined as 

C xy = — f dx f dyp(x,y)(x - fi x )(y - /i y ) (1.25) 

G X Oy J J 

with the averages fi x and and the standard deviations, a x and a y defined above. C xy 
takes values between —1 (complete decorrelation) and 1 (complete correlation). 

Another example can be drawn from physics: a magnetic system modelled by what is 
called the Ising model. Within this model the magnetic system is represented by spins 
(see the Quantum Mechanics course!) on a lattice. Each spin is a little vector that can 
only point up and down and is hence represented by a variable s that takes values ±1 (for 
up and down, respectively). Each spin is labelled by an index i that represents the site it 
occupies on the lattice. If the lattice is cubic and d dimensional and one then has n = L d 
spins in the system, with L the linear length of the d dimensional cube, the set we want 
to study is then X = {s\, . . . , s n }. The magnetized state is represented by a configuration 
such that the magnetization density, m = n~ x X)"=i s i-> takes a non-zero value while the 
paramagnetic state is such that m — 0. Note that the magnetization density m is just 
the average of the set X, m = ji x . Now, one can imagine that the configuration evolves 
in time, meaning that each spin changes its configuration as time evolves. And one may 
be interested in comparing the configurations of the total system at different times. This 
is achieved by computing the correlation (1.25) where the X is the set of n spin values at 
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one time, say ti, and the F is the set of n spin values at another time, say t 2 . In other 
words, X = {si(ti), s 2 (*i), • • • , s n (*i)}, F = {si(t 2 ), s 2 (t 2 ), • • • , s n (f 2 )} and 

1 n 

^ = (f w7 ^ E( s <(*0 " ^(*0)(s<(*2) - m(* 2 )) . (1.26) 

<t(*iM* 2 ; i=i 

Covariance. The numerator in (1.25) is called the covariance of x and y. 
1.5 Probability distributions 

In this Section we present some probability distributions that appear recurrently in the 
study of physical, biological, and sociological systems among others. 

The binomial 

A simple question one can pose now is: how many Hs do we expect after a finite number 
n of tosses? The binomial (or Bernoulli) distribution gives an asnwer to this question. 
The probability of getting h Hs after n tosses of the coin is 

p «(*) = (ft) Ai-?r\ 

with p the probability of getting an H is one toss, 1 — p the probability of getting a T in 
one toss, and the combinatorial number defined as 

{l) = Tn-^Wh\ »! = »(»-i)-i- 

A simply way of understanding this result is the following. 

The form of the bimodal distribution and its evolution with n is shown in Figs. 9 
(p — q — 1/2) and 12(p ^ q). Note that when n incrases the figures look more and 
more peaked about their maximum that occurs at n/2. This statement can be proven 
rigorously. 

With a simple calculation one proves 

E{h) =np, a 2 h = np(l- p) (1.27) 

(see Problem Set 1). Note that rvh = cr h /E(h) oc and this tends to when 

n — > oo. This means that the distributions look more and more smooth and peaked when 
n increases. 

The bimodal distribution is a 'two-parameter' one. n and p are the parameters con- 
trolling its form. 

To summarize, we have just studied a problem, the tossing of a coin and we have 
derived an equation that gives us the probability of occurrence of an event (h Hs) in a 
series of n experiments. This formula is valid, obviously, only for this problem (and others 
that can be mapped onto it). Even if it is just an example, it is illuminating and it allows 
us to define the concept of probability. 

A typical physical realization of the toss coin problem is the random walk problem. 
Take a one dimensional lattice with spacing a and a drunken walker that can occupy the 
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Figure 9: Evolution with n of the binomial distribution function. Note how the form is 
more and more continuous as n increases. One sees from these plots that the location of 
the maximum is at «n/2 while the width of the curve at half height is ss 2o \ rs y/n. 



sites on this lattice. One knows that in each time step the walker moves right one half 
of the times and it moves left the other half. After n time steps, the walker might have 
taken h steps to the right and n — h steps to the left. P n (h) represents the probability 
of this composed event. This rather simple problem has many application in physics and 
biology and it is at the basis of the theory of diffusion. 

In general, the bimodal disctribution applies to problems that can be cast in the form 
of a yes or no answer. For example, given a population of n patients one can wonder what 
is the probability that h among them have asthma if one knows that the probability of 
each patient having asthma is p. This is given by P n (h) with parameters n and p. 



The Poisson distribution 
The distribution 



P( x ) = V-e 



(1.28) 
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with fj, a parameter and x a discrete random variable, taking values x = 0, 1,2, . . ., is 
called Poisson probability distribution function (pdf). See Fig. 10. 

With a simple calculation one proves that the parameter \x is the expected value of the 
random variable, E{x) = \i and o\ = ji. 

Poisson PDF can be obtained as the limit of the binomial distribution when n is very 
large, p is small and np is kept fixed. 

Two examples of random variables described by the Poisson distribution are the fol- 
lowing. 

Take a gas confined to a volume V with average density p. If one divides the total 
volume into small boxes with equal volume v, the local density fluctuations from box to 
box. The number of particles in each little volume v is described by a Poisson distribution. 

Another example is the one of the local connectivity in the so-called Erdos-Renyi 
random graph with average connectivity c. The number of links reaching a vertex fluctuate 
according to a Poisson distribution with parameter li — c. 

The normal or Gaussian distribution 



I Q-m) 2 

^^vP 6 " 1 ^' (L29) 

where x takes all values on the real axis and fx and a 2 are two parameters, represents the 
Gaussian probability density. One easily checks, by direct integration, that the expected 
value of x, E{x)i equals \x and its variance, o^, equals a 2 . See Fig. 11. 

The Gaussian probability density can also be viewed as a limit of the binomial distri- 
bution. In this case one keeps p finite and takes n to infinity. 

The Gaussian distribution in its normal form has zero mean and unit variance. Given 
a generic Gaussian distribution, the normal form is achieved by defining y = (x — li)/g 
and transforming p(x) into p(y): 

P(V) = i e"£ . (1.30) 



The cumulative probability of a random variable with a Gaussian distribution are 
tabulated or can be computed numerically. They are of great use in Inferential Statistics. 

Approximating the binomial 

The calculation of the Poisson and Gaussian distributions is much simpler than the 
calculation of the binomial. Of course, these distributions are not identical. 

A rule of thumb tells that the Poissonian approximation to the binomial is rather 
accurate when 

n>20 p<0.05. (1.31) 

Let us check this statement with an example. Take a binomial distribution with n = 70 
and p = 0.02. The probability of three successes is 

70 1 

P(h = 3, n = 70) = — — 0.02 3 (0.98) 67 = 0.1151 . (1.32) 
67! 3! 

while the Poissonian approximation is 

1 4 3 

P(/i = np= 1.4, x = 3) = — e" L4 = 0.1128 , (1.33) 
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Figure 10: Poisson distribution for three values of ji given in the key. 
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Figure 11: Gaussian probability density. Left: fixed a 2 = 1 and three values of /i given 
in the key. Right: fixed jj, = and three values of a given in the key. 



a very good result. 

The binomial distribution is discrete while the Gaussian is continuous. To approximate 
the former by the latter one has to be precise what one really means. Thus, the binomial 
probability of having h will be approaximated by the Gaussian cumulative probability of 
finding the continuous variable x within the interval [h — 0.5, h + 0.5]: 



P(h,n) 



h+0.5 



h-0.6 



dxp(x) 



(1.34) 



Let us test this hypothesis with an example. Suppose that 15% of the cars coming out of 
an assembly plant have some defect. In a delivery of 40 cars what is the probability that 
exactly five cars hace defects? The actual answer is given by the binomial formula 



40! 
35!5! 



0.15 5 0.85 35 = 0.1692 . 



(1.35) 



np(l - p) = V40 x 0.15 x 0.85 = 2258. Thus, the Gaussian 



But the calculation of the factorials is quite complicated. What about the Gaussian 
approximation to this result? The mean and average are of the binomial fi = np = 
40 x 0.15 = 6 and a 
approximation is 

(1.36) 
0.15 is not 



dx 



2X2258 2 



0.1583 



/4.5 V27T2258 

and, again, this result is quite close to the exact one. Note that in this case, p 



82 



1.6 The law of large numbers 



A PROBABILITY AND STATISTICS 



0.1875 



0.125 
0.0625 





4 9 14 19 24 29 

x 

Figure 12: The binomial compared to the Gaussian. 

small (it is not smaller than 0.05 as in the above example where we used the Poissonian 
approximation). 

Levy distribution 

Levy distributions play an important rol ein the description of a number of statistical 
problems of current interest (finance, anomalous diffusion, etc). They are pdf's that falls 
off at infinity as a power law: 

ix A± 

L m(z) ~ > N -> oo . (1.37) 

pL is a positve parameter (to ensure the normalization to one of the pdf). The n-th moment 
diverges as soon as n > \x. In particular, if < fi < 2 then the variance diverges. 





n= 


=30 
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1.6 The law of large numbers 

We have already mentioned that if we toss a coin a large number of times, the number of 
Hs found will be close to n/2. This result is a particular form of the law of large numbers. 

Take a random variable x and measure it n times, i.e. draw the numbers xi,X2, ■ ■ ■ , x n . 
Construct the average of these numbers, as done in Sect. 3: 

1 n 

fi x = -J2xi. (1.38) 

The law of large numbers tell us that the probability for this value to be different from 
the expected value of x, E(x), tends to zero when n — > oo: 

P(\x- f i x \)>e)^ n ^ oo 0. (1.39) 

This statement can be rigorously proven. We shall not discuss the proof here. It can 
be found in the literature. 
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1.7 The central limit theorem 

The central limit theorem states that in the large n limit the random variable constructed 
with the sum of independent, identically distributed random variables, 

1 n 

y=-J2x t , (1.40) 

has a Gaussian distribution with expectation value equal to the expectation value of 
the original random variable, E{x), and variance given by o^jn. In other words, y is 
a continuous random variable (even if x might have been a discrete one) distributed 
according to 



p(y) = -_ e 2 ^ = /__ e . (1.41) 



X 




Since y is a sum over n random variables, divided by n, its expectation value is 

-Y,E(x) = E(x), (1.42) 
n i=i 

i.e. it is equal to the expectation value of the variables Xi that are supposed to be equally 
distributed. 

What about the variance of the random variable We have shown that the variance 
of a random variable z built with the sum of two random variables, x and y, is equal to 
the sum of the variances, a 2 = a 2 + a 2 , z = x + y. Now, in our case, each element in 
the sum is equal to x/n. And we also showed that the variance of x/n, cr 2 / n , equals the 

x/n 



variance of x divided by n 2 : a 2 , = cr 2 /n 2 . Hence, 



° y = (°-x/n + ■■■ + o- x/n ) = na x/n = ncrjn = ajn (1.43) 

Note the importance of this theorem for developing experiments. An experiment con- 
sists of the measurement of an observable. A single measurement does not make sense 
since the result found will be subject to many sources of noise. The result of an experi- 
ment only has a statistical sense. If one repeats the measurement n times and constructs 
the average of the results, x±, . . . ,x n , the central limit theorem ensures that the y will 
be a normal distributed random variable with expected value E(x) and variance cx 2 /n. 
Incresing the number n one then reduces the width of the Gaussian and for sufficiently 
large n one is sure to approach the actual expected value of the observable x with the 
average of the data, y. We shall come back to this issue below. 

Generalization in which the variables Xi are not equally distributed (but have finite 
expectation value and variance) are possible but we shall not discuss them here. 

In the case in which the variables to be added are still iid but have a probability 
distribution with a tail that falls as a power law with fx < 2, as in the Levy case with 
t4±, the limit law is a symmetric Levy law (if the right and left tails of p(x) have different 
amplitudes one obtains an asymmetric limit law with amplitude that are a combination 
of the original ones). 
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2 Some useful formulae 



Stirling formula for the factorial of a large number reads: 



lniV! ~ JVlnJV-lnJV , 



for 



iV> 1 . 



(2.1) 



3 The saddle-point method 



Imagine one has to compute the following integral 



1= [ dx e~ N ^ , 



(3.1) 



J a 



with f(x) a positive definite function in the interval [a, b], in the limit N — > oo. It is clear 
that due to the rapid exponential decay of the integrand, the integral will be dominated 
by the minimum of the function / in the interval. Assuming there is only one absolute 
minimum, x , one then Taylor expands f(x) upto second order 



/ „ e -"/(*o) / dx e -JVi/»(xo)(x-xo) a = e -iV/(xo) [7V/ // (a . o) ]-l/2 yb dy e -\{y~y f ? (3 3) 



with y = ^JNf"(x )xo and similarly for y a and y&. The Gaussian integral is just an error 
function that one can find in Tables. 

This argument can be extended to multidimensional integrals, cases in which there is 
no absolute minimum within the integration interval, cases in which the function / is not 
positive definite, etc. 

4 the Fokker-Planck equation 

Let us show how to obtain the above Fokker-Planck equation from the Langevin equation. 
The following derivation encompasses the multiplicative noise case; this allows one to see 
how the difference between the It 6 convention and the Stratonovich convention, discussed 
in Sect. ??, translates onto the evolution of the probability density. For simplicity, we 
will only consider here the one dimensional Smoluchowski case. 

Suppose first that one works with the Ito convention. Then, an arbitrary function G(x) 
of the random variable x, and that does not depend explicitly on time, evolves according 
to 



(3.2) 



and obtains 






dG = -r-dx + e 2 {x) 




[F(x) + e{x)i{t)}dt + e 2 (x) 



Td 2 G 

7 dx 2 



dt , 



(4.4) 
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where e(x) is the function multiplying the noise term £ in the Langevin equation (c/r. 
eqn ??). Averaging the above equation over the thermal noise leads to 

(G{x{t + dt)) - G(x(t))) = i (^F(x)} + ^ (e 2 W^) , (4.5) 

where we have used the fact that x(t) and £(t) are independent in the It 6 convention. 
Now, as an identity, any average (0(x)) of a function 0(x) can be written in terms of the 
probability density P(x,t) as 

(O(x)} = J 0(x)P(x,t)dx . (4.6) 

Hence, eqn (4.5) is equivalent to 

f , sdP(x,t) 1 r . 9G n , , n , , T r s &G n , 
/ diG x — = - / dx—F(x)P(x,t) + - / dxe 2 (x — ^r^(^0 • 
J at 'j J ox 7 j c/x^ 

After integrating by parts and noticing that the resulting equation holds for an arbitrary 
function G(x), one finally finds: 



7 ^<M = F{x)P{Xi t) j + T^[e 2 (o;)P(x, t) ] , (4.7) 

which is identical to the Smoluchowski equation (6.16) with the identification F(x) = 
—dV/dx and e(x) = 1. 

If we now consider the Stratonovich prescription, the starting point is simply 

dG = ^-dx = ^—\F(x) + e(x)£(t)]dt , (4.8) 

OX OX 'J 

but now one cannot assume that {0(x(t))£(t)) = when averaging the above equation 
over the noise £. Rather, one has, using Novikov's formula in a discrete time setting: 



= Yl lo\x k )^-[e(x k )5 Kl + e(x fc _i)(J fc _i )J ] ) 2 7 T5 M . (4.9) 
Therefore, taking 0(x) = e(x)dG / 'dx, one finally obtains 

f)r?~\ \ 

(4.10) 



dG 



A manipulation to the above allows then one to establish the following evolution equation 
for the probability density: 

^T = -|[ FWPfef )l + r l{ e W^l eW ^' f) l} ■ (411 > 

Note the difference with the above equation established in the Ito framework: the function 
e(x) appears differently in the diffusion term. It can easily be checked that this equation 
is equivalent to the Ito case provided the true force F(x) is replaced by an effective force 
given by F(x) — Te(x)e'(x). Hence, the different conventions lead in general to different 
results, because they describe different microscopic processes. The two Fokker-Planck 
equations become the same in the additive noise case e(x) = 1. 
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Mecanique Statistique TD 1 : 
Notions de base 

Le but de ce TD est de faire un rappel des notions de base en probability of mecanique 
statistique. 

1 Probability 

Soit la fonction f(x) = axe~ ax , definie pour des valeurs de x reels et positifs. 

1. Que sont les conditions que les parametres a et a doivent satisfaire pour que cette 
fonction soit une densite de probabilite ? 

2. Calculer sa moyenne - en introduisant une source - et l'emplacement de son maxi- 
mum. Sont ces deux valeurs les memes ? 

3. Que peut-on dire sur les proprietes de symetrie de cette densite de probabilite ? 

4. Calculer l'ecart-type, a. 

5. Relier la valeur d'expectation de x et de x + A avec A une constante. 

6. Relier la valeur d'expectation de x et de A x avec A une constante. 

2 Paramagnetisme 

Soit un ensemble de N spins, si = 1, . . . , N, en presence d'un champ magnetique externe. 
L'energie de l'ensemble est donnee par 

E = -hY,Si- (1) 

i=i 

L'ensemble est en contact avec un reservoir a temperature T. 

1. Calculer la fonction de partition et l'energie libre par spin. 

2. Calculer la densite d'aimantation du systeme. 

3. Calcule la susceptibilite magnetique. 

4. Identifier un parametre naturel pour ce systeme. Tracer la densite d'energie libre, 
la densite d'aimantation du systeme, et sa susceptibilite magnetique en fonction de 
ce parametre. Discuter. 
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3 Theoreme de fluctuation - dissipation 

Prenons un systeme caracterise par une energie E en equilibre thermique avec un reservoir 
a temperature T. 

1. Montrer le theoreme de fluctuation - dissipation 

d(A) h 



= (3((A-(A)f), (2) 

h=0 



dh 

ou h est un champs qui se couple lineairement a A: 

E-^E-hA. (3) 
Ce resultat ne depend pas du systeme considere. 

4 Modele d'Ising en une dimension 

On etudiera le modele d'Ising ou les spins, s = ±1 sont places sur un anneau de longueur 
L (i.e. avec des conditions au bord periodiques), sans champ magnetique externe. 

1. Calculer l'energie libre. Aide: definir les variables de lien, r\i = SjSj+i. 

2. Calculer la fonction de correlation G{fi — fj) = (siSj) — (si)(sj). Aide: calculer la 
fonction de correlation G{fi — r i+ i) comme la variation de l'energie libre par rapport 
a la contante de couplage J, entre les spins s, et s i+ i (on fixera tous les J, = J a la 
fin du calcul). 
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Mecanique Statistique TD 2 : 
Modeles avec des interactions a portee infinie 



1 Le modele d'Ising 

On considere le modele d'Ising avec des interactions ferromagnetiques de portee infinie 
entre p spins : la constante de couplage Ji x ...i vaut Jj 1 ...j = Jo pour touts les groupes de 
p spins : 

H = -J s h . . . s ip - hJ^Si. (1) 

1. Comment doivent se comporter J et h en fonction de N pour que le modele soit 
bien defini dans la limite thermodynamique ? 

2. Calculer l'energie et l'entropie moyennes pour le cas cas h — dans les limites T = 
et T — > oo. Que peut-on remarquer ? 

3. Ecrire la fonction de partition du systeme comme une somme sur les differentes 
valeurs de l'aimantation par spin m. Definir une densite energie libre a m fixee. 

4. En utilisant la methode du col, trouver une equation pour l'aimantation par spin, 
m. 

5. Dans le cas p = 2 comparer avec l'equation obtenue par l'approximation de Weiss 
(champ moyen) pour le modele d'Ising avec interactions entre plus proche voisins 
sur un reseau carre de dimension d. Que peut-on alors conclure sur l'approximation 
de champ moyen pour le modele d'Ising avec des interactions de portee infinie ? 

6. Etudier graphiquement l'equation pour l'aimantation par spin m avec ou sans champ 
magnetique. Distinguer les cas p = 2 et p > 2. 

7. Idem pour l'energie libre par spin f(m). 

8. Dans le cas p — 2, etudier l'aimantation pres de la transition et determiner les 
exposants critiques f3, 7 et 5. 
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Mecanique Statistique TD 3 : 
Systemes desordonnes 



1 Frustration 

1. Considerer trois spins, si, s 2 et S3, avec fonction d'energie 

E = -sis 2 - S1S3 - s 2 s 3 . (1) 

Calculer la fonction de partition, Z, l'aimantation moyenne de chaque spin, = 
(sj), avec i = 1,2,3, ainsi que la valeur moyenne de l'energie, (E). Calculer 
l'energie et l'entropie moyennes a temperature nulle. 

2. Repeter ces calculs pour l'energie 

E = -sis 2 - S1S3 + S2S3 . (2) 

3. Que constatez-vous ? 

2 Desordre 

1. Considerer deux spins si et s 2 couples selon 

E = -J.Si.S2 . (3) 

Calculer la fonction de partition, l'aimanation moyenne de chaque spin ainsi que 
l'energie moyenne en fonction de J. 

2. Prenons maintenant J aleatoire distribue selon une densite de probabilite continue, 
p(J) = l/( J+ — J_) pour J_ < J < J + et p( J) = autrement. Calculer la moyenne 
sur le desordre des aimantations: [m x ] = [( s\ )} et [m 2 ] = [(s 2 )] et de l'energie, 
[{E)]. Quel est la valeur de l'enegie moyenne de l'etat fondamental ? 

3 Propriete d'auto-moyennement 

1. Considerez une chaine de spins avec des couplages aleatoires : 

N 

E = -J2 JiSiSi+i . (4) 
i=i 

2. Calculer la fonction de partition Zj. 

3. Calculer l'aimantation locale moyenne m 8 = ( ) pour une realisation donnee du 
desordre. Est-elle invariante par rapport aux traslations ? 
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4. Calculer l'energie libre Fj. Quelle est la distribution de probability de Fj 

5. Montrer que Fj est une quantite auto-moyennante. 

6. Est Zj auto-moyennante ? 

7. Calculer la fonction de correlation (s^Sj ) et discuter ses proprietes. 

4 Le magnet de Mattis 

Montrer que le modele de Mattis n'est pas vraiment frustre. 
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Mecanique Statistique TD 4 
Dynamique stochastique 
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Mecanique Statistique TD 5 : 
Phenomenes de croissance de surfaces 
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